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Introduction 


lliyana Krapova 


Svetlana Nistratova 
Luisa Ruvoletto 


Giuseppina Turano 
Università Ca' Foscari Venezia, Italia 


Balcania et Slavia. Studies in linguistics is published by the Depart- 
ment of Linguistic and Comparative Cultural Studies at Ca' Foscari 
University of Venice. The primary goal of the Journal is to bring to- 
gether original publications in the field of theoretical, areal-typologi- 
cal, contrastive, and diachronic linguistics, which can provide a better 
understanding of the structure and the history of the Slavic and Bal- 
kan languages. In the academic landscape of Italy, the Journal prom- 
ises to become a unique virtual domain of interdisciplinary research 
in which the Slavic and the Balkan linguistic areas are put to scruti- 
ny so that they can feed off each other, in both empirical and theoret- 
ical terms. As editors, our aim is to publish contributions that bring 
to light different typological, areal or structural properties of these 
languages especially if such properties have remained unnoticed in 
current traditions or even if, however well-known, they call for in- 
novative approaches in order to be better understood and analysed. 

Issues of variation and micro-variation will be particularly rele- 
vant for the overall subject matter of the Journal because we believe 
that it is through a comparative description of the variation patterns 
in the various domains of language that we can gain a deeper view 
into the comparative grammar of the languages under study. This 
goes without saying for the well-known multilinguistic and multi- 
ethnic Balkan area that, due to the tormented Balkan history made 
of migrations, population mix and failed integrational policies, pre- 
sents, as many scholars have pointed out, a unique dialectal contin- 
uum with patterns of isogloss variation that have not yet received 
the attention they merit. 
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The languages of the Balkan Linguistic Union, which comprises 
Balkan Slavic, Balkan Romance, Greek and Albanian, have come to 
develop, over the past six or seven centuries, a number of shared 
grammatical (morphosyntactic) and lexical features currently known 
as ‘Balkan linguistic types’. But, as acknowledged by many scholars, 
‘Balkan linguistic types’ are much more than a random collection of 
commonly acquired features (loan constructions, calques, syntactic 
borrowings, etc.) as it is generally the case with contact-induced inno- 
vations among two or more neighbouring dialects or languages. Bal- 
kanisms are deeply integrated into the structure of some or all Balkan 
languages, and the specific Sprachbund effects they have produced 
go beyond a simple areal explanation and raise a number of ques- 
tions that are still open (for discussion, see the collection of articles 
in Krapova, Joseph 2018): the specific role of contact in bringing about 
the convergencies, the nature of the convergence itself (grammatical 
replication, some sort of copying or borrowing of surface structures), 
the mechanisms of transfer, etc. Another aspect worth exploring is the 
relation between the Balkan Sprachbund and other well-known lin- 
guistic areas including Standard Average European (SAE) conceived 
as a linguistic unity from a broad European perspective (Haspelmath 
2001). Old questions relevant for the description of specific Balkan 
convergencies are also still open for discussion: the degree of Balkani- 
sation of each language spoken in the Balkans; the potential source(s) 
of each common phenomenon; the number and fate of each Balkan- 
ism, in historically reconstructed, as well as in ongoing contact sit- 
uations. At the current stage of Balkan linguistics, we need to know 
whether these processes are still going on, in which fields or subfields 
of grammar, and whether there are significant outcomes of these pro- 
cesses in recent times (see the collection of papers in Sobolev 2021, 
among many others not cited here for lack of space). 

The problem of shared or common linguistic types is particularly 
relevant also for the Slavic area. A traditional concern in Slavic lin- 
guistics has been the issue of divergence in diachronic terms, name- 
ly in what ways Slavic languages have evolved in space and time as 
individual linguistic entities, with some attention also paid to par- 
allel (and non-contact-induced) developments motivated by genetic 
origin. Recently, however, new perspectives have emerged and have 
enriched the traditional historic interest, namely those of conver- 
gence due to language contact, both intra-Slavic as well as between 
Slavic and the rest of Europe. There is a renewed interest toward a 
more profound comparative-typological profile of the Slavic languag- 
es in terms of their potential SAE membership, as part of the core 
or the periphery (see the recent collection of papers in Danylenko, 
Nomachi 2019). Our Journal seeks to promote original studies aim- 
ing at discussing those and similar vitally important issues that will 
undoubtedly help delineate the Slavic area and define its essential 
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characteristics in reference to the ‘other’ Europe. We intend to pub- 
lish innovative research on all topics regarding the diffusion of cer- 
tain areal properties, grammaticalization in space and time, conserv- 
ative and innovative tendencies all of which have contributed in one 
way or another to shape the current Slavic landscape. 

Balcania et Slavia is an online peer-reviewed journal, published an- 
nually in two issues and available in open access. For our inaugural 
volume, we placed a call for papers and we also invited some schol- 
ars to submit original contributions that present different methodo- 
logical approaches to different fields of Slavic and Balkan linguistics: 
phonology, morphology, syntax, pragmatics, and sociolinguistics. 

Here is a brief synopsis of the papers included in the first issue 
and appearing in alphabetical order. 

Boban Arsenijevic’s paper "No Gender in Gender Agreement: On 
Declension Classes and Gender in Serbo-croatian" looks into gender 
agreement in Serbo-Croatian (SC) as a theoretical problem for this 
language, but also for theories of how agreement mechanisms work 
in general. The problem posed by SC gender agreement data is that 
in certain cases two different gender values are triggered by the con- 
trolling noun. In the face of these potentially problematic data, the 
author seeks to disentangle the agreement feature into primitives 
like [animate] and [human] and proposes a model of how the features 
compose with each other and interact both with system features (like 
declension class) as well as with semantic ones. The most important 
and challenging property of agreement according to the author and 
his generative-driven approach to the complexities of gender mor- 
phology is that gender does not play any role in syntax but gets in- 
stantiated only at the level of logical form (LF). 

The paper by Marco Biasio "It's All Under Control! On Perfec- 
tive Present Forms in BCS Main Clauses" aims at providing a uni- 
fied analysis of the syntax-pragmatics interface of the (allegedly) 
anomalous licensing of the perfective present (henceforth Pres?*) in 
Bosnian-Croatian-Serbian present-tensed main clauses. In Old- and 
Neo-Stokavian dialects, Pres™ forms cannot refer to eventualities 
that are anchored to the utterance time. However, there seem to be 
three apparent exceptions to this structural constraint. They com- 
prise: 1) abusive metonymic performatives, i.e. a closed class of dis- 
paraging phrases that figuratively perform on the insulted hearer 
the action referred to in the utterance; 2) running commentaries; 3) 
contexts featuring the epistemic operator mozda ‘maybe’. Building on 
some theoretical tenets advocated by neoperformative hypotheses, 
it is claimed in the paper that, for Pres?" forms to be licensed in BCS 
main clauses, the uninterpretable feature of control needs to be re- 
alised as a functional projection in the SpeechActP layer at the lev- 
el of the so-called ‘Seat of Knowledge’. 
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Erzhen Khilkhanova's paper “Language Ideologies and Multilin- 
gual Practices of Post-Soviet Migrants from a Translanguaging Per- 
spective” is dedicated to translingualism, which is a popular topic 
that attracts the attention of linguists in many countries for a num- 
ber of reasons. Within the Slavic area, the increase of interest is mo- 
tivated by migration phenomena and issues of resettlement of former 
Soviet citizens in the West. The paper describes, discusses and anal- 
yses multilingual practices of non-Russian migrants from the former 
Soviet Union whose native language is Russian. The translanguaging 
perspective uncovers language ideologies underpinning these prac- 
tices and serving as the main motivation for their replacement with 
linguistic assimilation despite the rich repertoire of multilingual re- 
sources available to these first-generation migrants. 

The study also reveals a change of language ideologies of post-So- 
viet migrants in the more democratic Western European context to- 
wards awareness of language equality and increasing symbolic val- 
ue of languages for ethnic identification. 

In their paper, “Postposed Articles and DP Structures in Torlak”, 
Jelena Zivojinovic, Beatrice Azzolina and Veronica Girolami investi- 
gate the enclitic article in the DP structure of Torlak, a non-stand- 
ard Balkan Slavic variety spoken in the Southeastern area of Serbia 
and in the bordering areas of Bulgaria and Macedonia. 

Torlak displays both Balkan and non-Balkan features: like Bulgar- 
ian and Macedonian, it presents a postposed enclitic article-like el- 
ement, but, at the same time, it does not exhibit multiple determi- 
nation, which in Bulgarian and Macedonian is characterised by the 
presence of a demonstrative and one or more definite article suffix- 
es within the extended projection of the noun. 

The authors’ proposal is that, while in Macedonian and Bulgari- 
an the articles seem to have undergone a full grammaticalization in- 
to purely functional elements and for this reason they are able to oc- 
cur with a demonstrative in a multiple determination construction, 
in Torlak instead, the grammaticalization of the determiner is only 
partially resulting in an inflectional affix that maintains the demon- 
strative semantics. 

Tsvetana Dimitrova's paper “On the Diachrony of the Clitic Clus- 
ter in Bulgarian" traces back the formation of the clitic cluster in Bul- 
garian starting from the Old Church Slavonic through Middle Bul- 
garian up to the Early Modern Bulgarian and beyond. The author 
proposes that the clitic cluster is split in two layers - the main lay- 
er consists of a (pronominal) core and a (verbal) periphery, while the 
secondary layer hosts elements that are not strictly clitic but are clit- 
ic-like or semi-clitics. The author reviews a number of data in var- 
ious diachronic corpora and reaches the conclusion that there has 
been practically no change in the positions of the elements within 
the core, while the changes that have occurred have interested the 
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periphery of the cluster and are due not to structural changes but to 
changes in the set, i.e. some elements of the pronominal and the aux- 
iliary system have been reanalysed while original clitics of the dis- 
course type have been lost. 

We hope that this Journal will become an open forum for interest- 
ing discussions where researchers and experts in the field share their 
work, stimulate novel research paths and advance the study on the 
various aspects of the Slavic and the Balkan languages. 

We are grateful to the members of our Advisory Board for the in- 
ternational prestige they confer to the Journal and for their precious 
suggestions. 

Many thanks go to the Editorial Board members who actively par- 
ticipated in the entire process. 

We must also highlight the fundamental work done by our review- 
ers who are among the most reputable scholars worldwide. They en- 
sured a double-blind peer-review process - with two scholars for each 
paper - and we would like to thank them all for their efforts and time. 

Last but not least, we would like to thank our publisher Edizio- 
ni Ca' Foscari, directed by Massimiliano Vianello, for the constant 
support. Our special thanks go to Mariateresa Sala for her great ef- 
ficiency and invaluable editorial help. 
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1 Introduction 


Serbo-Croatian (SC) nouns are traditionally divided in four declen- 
sion classes and three genders (e.g. Stevanovic 1989). Declension 
classes can be designated by a pairing of the nominative and geni- 
tive singular ending, for example: class <Ø, a>, class <o/e, a>, class 
«a, e», class <Q, i». Traditionally, they are referred to by numbers 
in the given order, as class I, class II, class III and class IV declen- 
sion, which is also the way I refer to them in this article." This is il- 
lustrated in table 1 for the nouns mrav ‘ant’, Zig ‘stamp’ (to illustrate 
the effect of animacy), more ‘sea’, selo ‘village’ (-e comes after soft 
consonants, -o otherwise), riba ‘fish’ and noć ‘night’. 


Tablei Declension class and gender in Serbo-Croatian 


Class! (<@, a>) Class II (<o/e, a>) Class Ill (<a,e>) Class IV (<Ø, i>) 

Sg Pl Sg Pl Sg Pl Sg Pl 
Nom mrav-@ mrav-i mor-e(sel-o) mor-a rib-a rib-e not-d noci 
Gen mrav-a mrav-a mor-a mor-a rib-e rib-a noti |. noci 
Dat = mrav-u mrav-ima mor-u mor-ima  rib-i rib-ama noci noć-ima 
Acc mrav-(a)(zig-@) mrav-e mor-e(sel-o) mor-a rib-u rib-e not-d noci 
Inst  mrav-em mrav-ima mor-em(sel-om) mor-ima rib-om rib-ama noć-ju no¢-ima 
Loc mrav-u mrav-ima mor-u mor-ima  rib-i rib-ama noci noć-ima 


The relation between SC declension classes and gender has been the 
subject of numerous investigations.? Grammatical gender stands for 
the type of agreement a noun triggers on declinable modifiers and 
predicates, and relies on the assumption that the agreement mark- 
ers on these items correlate with the declension class of the noun, 
as well as with the natural gender of the referent, as illustrated in 
(1). Neither correlation is absolute: a noun can show an agreement 
pattern that fails to match the natural gender of the referent or the 
declension class of the controlling noun, but not both (see (1d) for a 
mismatch with the natural gender). 


1 Dueto a high degree of syncretism, the first and the second class, i.e. classes <ø, 
a» and «o, a», are sometimes taken to be one class, yielding a classification with three 
classes, one of which has two subclasses. 


2 Stevanović 1989; Corbett 1982; Corbett 1991; Wechsler, Zlatić 2003; Despić 2017; 
Alsina, Arsenijević 2012a; 2012b; Arsenijevic, Gračanin Yuksek 2016; Puškar 2017 
among others. 
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(1) 


a. bel-a krava b. bel-i bik c. bel-o tele d. mlad-a gospoda 
white-F cow white-MAsc bull white-N calf young-F gentlemen 
‘white cow’ ‘white cow’ ‘white calf’ ‘young gentlemen’ 


Inanimate nouns are orthogonal to gender, and their agreement is 
fully predictable from their declension class: class I nouns trigger 
masculine agreement, class II nouns neuter, class III and IV nouns 
feminine. Furthermore, considering animate nouns, class I includes 
only masculine members (i.e. only nouns that denote male or gender- 
unspecified individuals), class II only neuter (i.e. those referring to 
offspring, which is arguably conceptualised as genderless), class IV 
takes no animates at all.* This all suggests that gender could be elim- 
inated from the system, by modelling agreement purely in terms of 
declension class: masculine gender agreement is class I agreement, 
neuter is class II, feminine class III and IV. The ground for relying 
on gender comes from the so-called hybrid gender agreement nouns, 
such as class III nouns triggering masculine agreement in singular 
as in (2), where the agreement in the singular is not the one indicat- 
ed by the declension class, but arguably associated with the gender 
of the referents (Corbett 1991; Aronoff 1994). 


(2) 

moj-@  ded-a prv-i komsij-a mlad-@ delij-a 
my-MASC grandpa-ill.NOM.sG first-MAsc neighbour-III.NOM.sG young-MAsc dude-Ill.NOM.SG 
*my grandpa' ‘neighbour next door’ ‘young dude’ 

moj-e ded-e prv-e komSij-e mlad-e delij-e 


my-FEM.PL grandpa-lll.NOM.PL first-MASC.PL neighbour-III.NOM.PL young-MASC.PL dude-Ill.NOM.PLG 
*my grandpas' ‘neighbours next door’ ‘young dudes’ 


This step, however, introduces the question of the division of work 
between the two sources of information. Is agreement determined 
by the declension class, unless it clashes with gender, or is it deter- 
mined by gender, unless a noun lacks it - in which case declension 
class jumps in? Or do actually all nouns, animate or not, have a gen- 
der feature, which remains uninterpreted in inanimates? 

Before going deeper with the discussion, let me address some ap- 
parent exceptions to the generalisations above about the gender - de- 


3 Prescriptive grammars mention nouns kci ‘daughter’ and mati ‘mother’ as class IV 
animates but, in most present day spoken varieties of SC, these nouns have migrated 
to class III, with a more or less changed shape. 
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clension class mapping, pointed by an anonymous reviewer. One pat- 
tern is that exhibited by two types of nouns which show endings that 
formally may be attributed to class II, but trigger the traditional mas- 
culine agreement. The first group are animate nouns, including prop- 
er names and derived nouns (mostly with the suffix -ko for expressive 
nouns denoting property bearers), as illustrated in (3). 


(3) 

a. Jan-k-o b. Per-o c. prljav-k-o d. rumen-k-o 
Jan-SUFF-?.NOM.SG Per-?.NOM.SG dirty-SUFF-?.NOM.SG  ruddy-SUFF-?.NOM.SG 
‘Janko’ ‘Pero’ ‘dirty little one’ ‘ruddy little one’ 
Jan-k-a Per-a prljav-k-a rumen-k-a 
Jan-SUFF-?.ACC.SG Per-?.ACC.SG dirty-SUFF-?.ACC.SG ruddy-SUFF-?2.ACC.SG 
‘Janko’ ‘Pero’ ‘dirty little one’ ‘ruddy little one’ 


In spite of sharing some endings with class II, these nouns rather be- 
long to class I, since their accusative case in singular takes the end- 
ing as in class I rather than being syncretic with the nominative - as 
characteristic of class II nouns (see table 1). The ending -o in the nom- 
inative singular can be explained on a phonological ground, in the in- 
teraction of prosody and stem-final consonant clusters. In favour of 
the view that these nouns have an exceptional nominative singular, 
but otherwise belong to class I, speaks also the fact that for a subset 
of these nouns (those like the name Pero or the common noun medo 
‘little bear’) in some varieties, people share the nominative singular 
in -o, but otherwise decline them in class III. 

Two properties of the nouns in (3), however, point in the direc- 
tion of class II. One is that they cannot have plural forms, which is 
a property they share with uncontroversial animate class II nouns 
(Arsenijević 2017). The other is that they have a hypocoristic, hence 
expressive meaning and, as discussed in § 5, expressive meanings 
are a type of markedness that triggers the assignment of declension 
classes of a higher degree of markedness (especially class II and III). 
The difference between the nominative and the accusative (not attest- 
ed in uncontroversial class II animates) would then be an argument 
in favour of a realisational nature of declension classes. Still, this 
would not be in clash with the main thesis of this article: that gram- 
matical gender is a derivative of declension classes and not vice ver- 
sa. If declension classes too are derivatives of lexical semantic prop- 
erties of the bases (i.e. of roots or complex structures), in particular 
properties of quantity, natural gender and expressiveness, this is a 
further step in theoretical reduction. 
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The second group are borrowed nouns ending in -o or -e, some of 
which are inanimate and therefore cannot be tested for the accu- 
sative singular, as in (4a-b) (in inanimate class I nouns, this form is 
syncretic with the nominative, just like in class II nouns). Still, look- 
ing at the animate among these nouns, as in (4c-d), they do show the 
class I pattern in the accusative (that they belong to the same class 
is confirmed by sharing the same pattern with accented final sylla- 
bles, as in (4b, d); moreover, accented final syllables are not attested 
in class II). Furthermore, for most speakers, the nominative plural 
ending for all these nouns is the one of class I. While there are some 
speakers that prefer the class II ending, importantly, most of them 
also prefer the neuter agreement for these nouns, indicating that in 
their grammar they belong to class II. 


(4) 

a. radi-o b. kabar-é c. impresari-o d. krupij-é 
radio-?.NOM.sG cabaret-?.NOM.SG impresario-?.NOM.SG croupier-?.NOM.SG 
‘radio’ ‘cabaret’ ‘impresario’ ‘croupier’ 
radi-a kabar-è impresari-a krupijè -a` 
radio-?.ACC.SG cabaret-?.ACC.SG impresario-?.ACC.SG croupier-?.ACC.SG 
‘radio’ ‘cabaret’ ‘impresario’ ‘croupier’ 
radi-i kabaré -i impresari-i krupijè -i 
radio-?.NOM.PL cabaret-?.NOM.PL impresario-?.NOM.PL croupier-?.NOM.PL 
‘radio’ ‘cabaret’ ‘impresario’ ‘croupier’ 


* Nounsofthis type with an accented final syllable preserve this syllable as part of 


the stem in declension. 


For speakers who agree them in masculine gender, all these nouns 
are hence class I nouns with exceptional endings in the nominative 
singular - conditioned by phonology and for some of them also the 
shape in the donor language, rather than being members of class II. 

The other seemingly exceptional pattern pointed out by the review- 
er are class II and IV nouns used for animate individuals, as illustrat- 
ed in (5). Arguably, however, these nouns do not have animate denota- 
tion. They denote abstract notions, likely tokens of properties. Their 
animate use is either metaphorical, i.e. metonymic, or idiomatic - in 
both cases irrelevant for the productive properties of the declension 
classes (Moltmann 2004; Villalba 2009; Arsenijevic 2012), and the 
use for animates, or humans (if at all they ever denote animate in- 
dividuals, since for instance none of these nouns can be used refer- 
entially, e.g. as an answer to the question Who told you that? - see 
Arsenijevic et al 2021 for a discussion and experimental support re- 
garding this kind of non-referential nouns in class III). 
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(5) 

a. lik-n-ost-© b. velik-an-stv-o c. Cud-ov-ist-e 
person-ADJ-N-IV.NOM.SG — great-ADJ-N-IILNOM.SG — marvel-ADJ-AUGM.N-II.NOM.SG 
‘personality’ ‘majesty’ ‘monster’ 


Back to the issue of gender agreement, different analyses have been 
proposed for it - whether specifically for SC, or for other Slavic lan- 
guages (such as variation in patterns of agreement available regard- 
ing the mapping between gender and declension classes is limited, 
analyses for one Slavic language with declension are quite straight- 
forwardly adapted for another). Corbett (1991), Aronoff (1994), 
Wechsler and Zlati¢ (2003) among others postulate two independ- 
ent features for the two sources of information: each noun is speci- 
fied for both gender and declension class (where the former may be 
even further split), and grammatical operations and hierarchies de- 
cide about agreement. Others, like Müller (2004), Puškar (2017) or 
Caha (2021) propose to fully discard the declension classes as syntac- 
tic features or diacritics, and derive them from other features. While 
Miller (2004) does not identify the particular features using formal 
labels instead (F,, F,...), Puškar (2017) indicates them to centrally in- 
volve gender, and Caha (2021) explicates a set of four relevant fea- 
tures: class, feminine, individuated, and a fully unmarked case fea- 
ture. A significant advantage of the latter type of accounts is that they 
can explain regularities about intra- and interdeclension syncretism. 

Alternative approaches, such as Hachem (2015), Fassi Fehri (2018), 
Arsenijevic (2017), argue that declension classes as well as agree- 
ment reflect properties of quantity (which include animacy and gen- 
der as markers of a high degree of individuation, see e.g. Matasovic 
2004), rather than gender alone. Particularly relevant for the present 
paper is Corbett's (1982) point that grammatical gender can be de- 
rived from the declension class. 

The central goal of this paper is to develop an explicit reduction- 
ist analysis of the relation between gender and properties of quanti- 
ty on the one hand, and declension classes on the other, where it is 
the declension class that is represented by a feature, with gender ef- 
fects deriving from it, rather than the other way around as previous- 
ly argued. Quantitative as well as theoretical arguments are provid- 
ed in favour of the proposed view. 

The paper is organised as follows. $ 2 outlines the main tendencies 
of the previous accounts for declension classes in SC and other Slav- 
ic languages. 8 3 discusses the desiderata for an analysis and where 
previous analyses failed to achieve them. 8 4 lays out the quantita- 
tive data obtained from an annotated database of the 4,718 most fre- 
quent nouns in SC. 8 5 presents a novel analysis on which declension 
class is lexically assigned and occurs in syntax as a feature, and ef- 
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fects on gender and properties of quantity associated with declen- 
sion classes emerge via pragmatic competition. $ 6 proposes a novel 
morphological analysis of the SC declension class system on which 
class I lacks endings all together, and is entirely realised in terms of 
syncretism with other classes. The other three classes are ordered 
according to the balance between the role played by the theme vow- 
el and case endings, and their tendency to be realised overtly. 


2 Declension Class in Terms of Realisation 


Realisational approaches to morphology, such as Distributed Mor- 
phology (Halle, Marantz 1993) and Nanosyntax (Starke 2010), take 
syntax to operate on features (and roots) only, and resolves the prob- 
lems of morphology in an interaction between syntactic structures, 
the lexicon and the phonology, at their interface. Hence, instead of 
postulating both gender and declension class as two separate fea- 
tures which enter grammatical interactions, these approaches offer 
an alternative where declension class is a matter of realisation sensi- 
tive to gender. Indeed, Puškar (2017) indicates that declension class is 
best modelled in terms of gender-sensitive realisation - unfortunately 
without developing a concrete proposal. Privizentseva (2020) offers 
a more concrete analysis for Russian - but she simply stipulates an 
opaque formal feature which together with the feature of gender de- 
termines the realisation of case. The analysis hence merely formalis- 
es the view that declension class is a function of gender, but does not 
pinpoint the exact additional component that completes the specifica- 
tion of this mapping.^ The only attempt to provide a transparent (in 
the sense that concrete, independently attestable features are used) 
and explicit realisational account of declension classes in Slavic lan- 
guages is Caha (2021) with a model of Russian declension classes. In 
what follows I will give only a brief assessment of his analysis, try- 
ing to minimise the unavoidable injustice to the presented work en- 
forced by spatial limitations. 

Caha outlines the entire path of reasoning from having the rele- 
vant functional features realised by roots (or more broadly nominal 
stems), via a view where the roots realise a relatively small function- 
al sequence, while the features relevant for declension classes are 
realised by the very endings, to a hybrid, where in some case forms 
the root realises a long span, while in others it shrinks and the end- 


4 Similar in spirit is Müller's analysis (2004), where two opaque features are intro- 
duced to capture the syncretism between Russian declension classes. However, this 
analysis has no ambition to derive declension class from gender via realisation, but rath- 
er considers the two features that determine it independently present in the structure. 
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ing realises the relevant projections. 

For illustration, the singular case forms of the Russian noun zavod 
'factory' receive realisations as in (6) (Caha's (101)). In the nomina- 
tive and the accusative form, the root realises the lexical content, the 
features REF(ERENCE), CLASS, IND(IVIDUATED), and the formal features 
combining to determine the particular cases. In the remaining case 
forms, the root shrinks, and the ending realises the features CLASS, 
IND, next to the relevant case features. 


(6) Re-elaboration of Caha's analysis of zavod ‘factory’ 


F4 |F5 |F6 


The costs the analysis pays for its great reductionist capacity include 
treating all the members of one of the two bigger declension classes 
(the class in -a in the nominative singular) as idioms where the root 
is contextually specified to only occur in structures which include 
the feature FEM(ININE). Moreover, all these nouns involve mismatch 
between the LF and the PF, in the sense that their meaning matches 
the entire idiom, and their phonological realisation only the struc- 
ture below FEM. Furthermore, as Caha explicitly admits, the feature 
FEM in his account has nothing to do with the feminine natural gen- 
der of the referent (otherwise it would not only make all the inani- 
mate nouns in the class interpreted as female, but also those with 
animate male referents such as d'ad'a ‘uncle’ or the name Ilya). The 
natural gender is an independent factor, which is not syntactically 
represented, and competes for triggering agreement with the syn- 
tactic feature FEM. Recall that animate nouns with mismatches be- 
tween the declension class and gender are the reason for postulat- 
ing gender in grammar, in addition to declension classes. Once the 
feature is dissociated from the semantics of gender, the analysis is 
not transparent any more (i.e. the feature is no different from Priv- 
izintseva's 2020 additional opaque feature). 

Caha considers different alternative analytic strategies, pointing 
out various problems that they face, and indicating that a realisational 
analysis of Russian declension classes superior to the one he proposes 
is not possible considering the facts. Yet it is questionable whether the 
sacrifices that his analysis makes are worth the goal which is achieved. 
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3 Basics for a Better Model of Declension Classes 


The desiderata stated in $ 1 include reduction and transparency. Re- 
duction is achieved if one of the two phenomena, declension class or 
gender, is derived from the other. Transparency is achieved if the 
phenomenon that serves as the basis for the derivation of the oth- 
er one is independently attested (i.e. in other domains of syntax, or 
in semantics or phonology). Transparency thus further contributes 
to reduction. 

Gender is independently attestable in semantics, but only for an- 
imate nouns. Declension classes are attested in morphology, in the 
endings which different nouns take for the same case form (i.e. in the 
same syntactic contexts). A logical step for realisational approach- 
es is to have the former represented as a syntactic feature (because 
it is LF interpretable), and derive the latter from it via some realisa- 
tion rules (as realisational approaches generally do with morpholog- 
ical realisations). 

For transparency, this relation needs to be total and systematic. 
There should be a set of features such that each declension class is 
characterised by a unique subset of those features. All the members 
of a declension class then should share this exact specification - they 
should not lack any of the features from the combination, nor bear 
any additional ones from the set. 

In this paragraph, I consider the likely candidates for semantic dif- 
ferences and, based on a corpus investigation, argue that while ten- 
dencies are clearly observable, no sufficiently strong contrast can be 
identified. This leads to the conclusion that the declension class must 
be a syntactic primitive, i.e. in feature-based approaches: a feature or 
a combination of formal features whose direct interpretation is the 
specification of the set of endings realising case forms of the noun. 
To achieve reduction, then, the only option is to derive gender from 
declension classes, which is the path I take in $ 2. 

Another important observation, which previous analyses overlook, 
is that what is in the literature referred to as (grammatical) gen- 
der (as discussed above regarding Caha's view) is effectively also a 
choice between declension classes. As already outlined in 8 1, gram- 
matical gender is identified by the agreement patterns the noun trig- 
gers. More concretely, it is identified by the set of endings the agree- 
ing item takes. There are altogether three such sets of endings, one 
that is described as masculine, one as feminine and the third as neu- 
ter. But not only, as pointed out in the discussion of Caha, these end- 
ings establish a correlation with gender in animate nouns - they al- 
so fit the definition of declension classes. The only difference is that 
agreeing items are lexical categories that are not restricted to one 
declension class - they can combine with all three of them. In oth- 
er words, what is referred to as grammatical gender is a declen- 
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sion class too: masculine is the adjectival declension class I, neuter 
is class II and feminine class III (this is also reflected in the shared 
endings between nominal and adjectival declension classes or mor- 
phological components thereof). Approaches which derive declension 
classes from gender thus effectively derive the declension class of 
the controller from the declension class of the agreeing item. There 
is something paradoxical about it. 

One big problem for predicting the declension class from the gen- 
der feature alone is quite obvious: declension classes III and IV are 
both described as bearing feminine gender (i.e. they trigger exactly 
the same agreement patterns). Only based on gender, it is hence im- 
possible to predict which noun will realise its case by class III and 
which by class IV endings. 

Obvious candidates for the features that conspire with declension 
class to determine agreement are those that Caha (2021) employs in 
his model. Caha's final account however does not make use of all the 
four features it includes - rather, features CLASS, FEM and IND always 
remain together, and may for the intents and purposes of his analy- 
sis be replaced by only one feature - or it could be only one of them 
that plays the relevant role. 

Feature REF is supposed to be common for all nouns, or at least 
to have the same status in all of them (Baker 2003). Feature FEM ba- 
sically stipulates agreement and effectively splits the set of declen- 
sion classes in two (the masculine and the neuter declension class 
on the one hand, and the two feminine declension classes on the oth- 
er). Features CLASS and IND, which are closely related to properties 
of quantity (such as count, mass, atomic), should then be responsible 
for distinguishing at least between the former and the latter two. As 
two features have the potential of specifying four different classes 
(absence of both, only cLAs, only IND, both CLASS and IND), the ques- 
tion emerges whether the remaining two features are needed at all. 

Works like Hachem 2015, Fahri Fessi 2018 and Arsenijevic 2017 
follow exactly this path: they argue that the unit of counting (i.e. pos- 
sibly its absence) and the type and degree of individuation are the 
properties behind not just declension classes, but actually also what 
is traditionally seen as gender. In this family of views, it is the finer di- 
vision along these dimensions that plays a role: rather than reflecting 
mass vs. count denotations or a mere degree of individuation, the rel- 
evant features reflect particular kinds of mass, or different degrees 
of individuation. This view takes differences between non-atomised 
masses (space, time), atomised masses (sand, powder), collective de- 
notations (crew, clan), vague count denotations (ocean, field, plane), 
together with the opposition between abstract and concrete denota- 
tions, animate, inanimate and human, or events, properties and in- 
dividuals, as the base for the emergence of declension classes. Natu- 
ral gender then presents only a fraction of the set of properties which 
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conspire to yield the traditional gender agreement and the division 
of nouns into declension classes cross-linguistically. 

Arsenijević (2017) supports this view by data from SC showing 
that neuter nouns (which all also belong to declension class II) man- 
ifest a range of deficiencies regarding individuation. Examples as in 
(7) manifest that neuter agreement is not available to highly individ- 
uated plural uses such as in demonstratives used with pointing, an- 
imates generally or conjunction. 


(7) 

a. Ona *(deca) su neumorna. (pointing at children) 
those.NEUT.PL child.NEUT.PL ^ AUX.PL tireless.NEUT.PL 
‘Those children are tireless’ / *‘Those are tireless’. 


b. dugm-e dugm-eta tel-e *tel-eta 
button-NEUT.SG button-NEUT.PL calf-NEUT.SG calf-NEUT.PL 
‘button’ ‘buttons’ ‘calf? 

c. Sel-o i polj-e su potopljen-i / *potopljen-a 


village-NEUT.s6 and  field-NEUT.sG are.PL flooded-masc.PL / flooded-NEUT.PL 
‘The village and the field are flooded’. 


On this basis, Arsenijevic argues that neuter corresponds to the ab- 
sence of the classifier feature, i.e the absence of Landman's (2011) 
neat generating capacity (see also Arsenijevic 2017), which blocks 
the formation of a proper plural. He claims that the form traditional- 
ly analysed as N.PL is rather a collective plural form (a weak division 
over a messy quantity structure in Landman's sense), unable to ex- 
press highly individuated plural meanings. 

In modelling the relation between gender, properties of quanti- 
ty and declension classes, a quantitative insight into the structure 
of the nominal category in the language use is highly informative. 
Seeing what proportions of the nouns in each class are accounted by 
each of the approaches, and how many remain unaccounted for can 
give us a measure of plausibility of these accounts. This can help us 
both determine the minimal set of features needed to account for the 
facts, and the optimal direction of derivation, i.e. whether declension 
class should be derived from gender and properties of quantity, or it 
should be lexically specified, and display effects on the interpreta- 
tions in the domain of gender and properties of quantity. 
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4 Declension Classes, Gender and Properties of Quantity: 
A Quantitative Corpus-Based Analysis 


To empirically test the modelling options in the domain of declen- 
sion classes, gender and properties of quantity, I performed a cor- 
pus-based research. The material was assembled by first excerpting 
the 5,000 most frequent SC nouns from the srWaC corpus (Ljubesic, 
Klubicka 2016), and removing doubles and non-words, which reduced 
the material to a database of 4,718 nouns. The nouns were then an- 
notated for the properties in table 2 (the database is annotated for a 
significantly larger set of properties, but only the ones relevant for 
the discussion are given in the table).* 


Table2 Annotated properties of SC nouns in the database 


Freq Frequency (tokens per million words) 

Class Declension class (I-IV) 

Gram. gender What agreement the noun triggers (MASC, FEM, NEUT, HYB[rid]) 

Nat. gender Lexical semantic restriction of the sex of the referent (NMASC, 
NFEM, NNEUT, Ø) 

Anim Animate (1, 0) 

Count Does the noun inflect for plural (1, 0) 

Mass Does the noun combine with mass quantifiers like sve ‘all’ (1, 0) 

Coll Does the Sg noun allow both Sg and Pl agreement (1, 0) 

Group Does the noun in the singular denote a group of entities (1, 0) 

Name Is ita proper name for any class of individuals (1, 0) 

Suff Does the noun have at least one suffix (1, 0) 

Suff1 What is the first external (linearly last) suffix of the noun if it has any 


Additionally, the mass nouns in the database have been annotated 
for the properties in table 3. 


Table3 Additional properties annotated only for mass nouns 


Abstract Is the noun abstract (1, 0) 
Atomised Does the denotation of the noun involve a level of minimal units (1, 0) 
Atom-type If atomised, what is the type of atoms (concrete, abstract, event, vague) 


Mass-type The ontological class (spatial, temporal, structure, domain, dimension, 
complex, assembly, substance, emotion, condition, capacity, property, 
event, vague count) 


5 The database was annotated by two annotators, the author and Adisa Nanić, a mas- 
ter student and native speaker of SC. Inter-annotator disagreements were resolved by 
consulting other native speakers. 
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The database indicates significant differences between the sizes of 
declension classes. Class I and class III are the largest, with 2,259 
and 1,563 member nouns, respectively. Class II is somewhat small- 
er, with 720 nouns, and class IV is very small with only 178 mem- 
bers. Moreover, it turned out that 7196 of class II nouns are derived 
from the (predominantly deverbal) suffix -VV.je (Arsenijevic 2010; 
Simonović, Arsenijevic 2014), and 72% of class IV nouns from the de- 
adjectival suffix -ost. There are hence only 51 nouns of class IV which 
are not derived from the suffix -ost in our sample. The proportion is 
probably even more drastic in the aggregate lexicon, as the suffix is 
highly productive, thus implying a large number of hapaxes. 

Quantitative analyses have been performed for the relation of de- 

clension class with the properties Gender, Anim, Mass, as well as 
the within-mass properties Abstract, Atomised and Atom-type. The 
remaining annotated properties either did not show a significant ef- 
fect, or their effect could be subsumed under that of one or a com- 
bination of the discussed properties. 
Grammatical gender establishes a very tight match with declension 
classes. Class I systematically agrees with masculine, class II with 
neuter, class IV with feminine gender. The only locus of incomplete 
match lies in class III, which has a number of human members show- 
ing hybrid gender agreement: masculine in singular and feminine 
in plural (see (2)). In the sample of 4,718 nouns, with 1,563 class III 
nouns, 46 such nouns are identified (0.9796 of the database and 2.9496 
of class III nouns). Grammatical gender is hence a very good predic- 
tor of declension class, and assuming that each gender value is rep- 
resented as a different feature (combination), a realisational account 
based on it covers a very large portion of the data. In order to cover 
the small subset of class III nouns that trigger hybrid agreement, one 
first needs to see whether there is a feature that all of them share. Of 
all the properties annotated in the database, two are shared by all 
these nouns: they are all animate, and none of them is specified for 
feminine natural gender (i.e. 14 are male proper names, 8 are nouns 
denoting male roles such as čika ‘uncle’ or papa ‘pope’, and the re- 
maining 24 are gender-general, like sudija ‘judge’ or arhitekta 'archi- 
tect’). A realisational account is hence possible where +GMASC (i.e. 
grammatical masculine gender) triggers class I realisation, except 
in the context +GMASC +HUM-NFEM +SG (grammatical masculine, hu- 
man, non-natural-feminine, singular), which triggers class III reali- 
sation. Inversely, grammatical gender can be derived from the com- 
bination of the declension class and the remaining features above 
(+HUM-NFEM +SG). 

An additional problem for a realisational account of the declen- 
sion class is that not just a majority of class III forms trigger fem- 
inine agreement, but also all the nouns in class IV. Hence, a noun 
specified for +G _ FEM is underspecified between class III and class 
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IV realisations. Yet, as the discussion of other properties in a quan- 
titative perspective below shows, class III is open for all the values 
of all the considered properties, which means that there is no prop- 
erty which could be represented as a distinctive feature or a distinc- 
tive set of contexts for the two declension classes. Note that, if de- 
clension class is specified and gender agreement is realisational, no 
ambiguity arises as feminine agreement can be specified to be real- 
ised in both types of contexts: class IV and class III with the excep- 
tion of III +HUM-NFEM+SG, where it is the masculine agreement that 
gets realised. Recalling further that the actual substance of gender 
agreement is declension class assigned to the agreeing item, it gets 
even more plausible to consider declension class of the noun as the 
primitive, rather than gender agreement. 

Natural gender posed a problem for annotation, in there being differ- 
ent strengths of gender presuppositions. Among those nouns that 
qualify for it (all the animate nouns), some have a relatively weak 
presupposition, others moderate and yet others a strong one. Any 
test that was tried faced this issue, including the one eventually im- 
plemented, illustrated in (9). 


(8) 

a. ženski jaguar b. ženski sudija c. ?Zenski papa d. #Zenski deda 
female jaguar! female judge. female pope. female grandpa.ii 
‘female jaguar? ‘female judge’ ‘female pope’ ‘female grandpa' 

e. Zenka-jaguar f. Zena-sudija g. ?Zena-papa h. #Zena-deda 
female-jaguar. woman-judge.ill woman-pope.IlI woman-grandpa.ill 
‘female-jaguar’ ‘woman-judge’ ‘woman-pope’ ‘woman-grandpa’ 


The problem for annotation actually only reflects the fact that the 
natural gender of a noun is not a discrete value, as it is strongly in- 
fluenced by cultural constructs. Whether an animal will be prototyp- 
ically represented as a male or a female, how immanent a property 
of a pope it is that it is a male, and how immanent it is for a grand- 
pa - are not questions that render clear-cut categories. Generally, the 
degradation is stronger for humans than for animals, and for nouns 
with an established minimal pair in gender than for those without 
it. It is also typically stronger in singular than in plural. In the an- 
notation, therefore, only human nouns were annotated for natural 
gender,* and it was specified for any noun that causes degradation 


6 Very few animal-denoting nouns caused degradation in combination with the ad- 
jective muski ‘male’ (svinja ‘pig’, krava ‘cow’, ovca ‘sheep’, koza ‘goat’, macka ‘cat’), i.e. 
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when combined with the adjective muski ‘male’, i.e. in a (semi-)com- 
pound with muskarac ‘man’ (as +NFEM) or Zenski ‘female’, i.e. Zena 
‘woman’ (as +NMASC), irrespective of the strength of degradation. 

Even though in the annotation the mapping of this property onto 
discrete values was forced, looking only at the singular forms, the 
insights above suggest that natural gender cannot be a grammatical 
feature, as grammatical features are stable discrete formal objects, 
rather than continuous scalar ones, which moreover change depend- 
ing on other features (number in this case). 

There are two more reasons why natural gender is unlikely to be 
a feature. One is that it often emerges as a consequence of the exist- 
ence of a minimal pair of lexical items specialised for the opposite 
genders (see also Merchant 2014; Sudo, Spathas 2019). This indicates 
that lexical competition in pragmatics is a possible source of natural 
gender, even if the lexical semantics does not necessarily include it 
(this is supported by the effect of number, as in plural, gender-gen- 
eral interpretation figures more prominently in the pragmatic com- 
petition, see also Puškar 2018; Mitić, Arsenijević 2019). 

The other reason is the quantity of nouns carrying natural gen- 
der. In the entire database, 192 nouns are annotated as restricted 
regarding natural gender (139 to male and 53 to female referents). 
This makes 4.0796 of the database. It is not just unusual, but also un- 
economic that a system employs a feature marked for either of its 
two values on such a small fragment of the targeted lexical category. 
Animate nouns are most frequent within class I, with 32.6496. Class 
III is second with 10.7196 of animates. In classes II and IV, animate 
nouns have a marginal presence with 5 (0.6996) and 4 animate mem- 
bers (2.2596), respectively. Even among these nine members, one in 
class II and two in class IV are collective nouns that are not tradition- 
ally considered proper animates. Moreover, as shown in Arsenijević 
(2017), even animate diminutives, which would add a couple of dozens 
of lower frequency animates to class II if the sample were expanded, 
all show a defective behaviour, manifested for example in failing to 
inflect for plural. Assuming that the seven exceptions are idiomatical- 
ly stored, declension classes can be divided in two carrying or real- 
ising a -ANIM feature (class II and IV, assuming the animates are lex- 
ically stored as idioms) and two without it (class I and IIT). 

All together there are 542 count animate common nouns and 377 
personal names in the database - in sum 919 count animate nouns, 
i.e. 19.4896 of the sample. Only these nouns are candidates to car- 
ry natural gender features, as they can only be interpreted on ani- 
mates. In the remaining over 8096 of nouns, gender features may only 


Zenski ‘female’ (konj ‘horse’, pas 'dog'). All of them have specialised nouns for the gen- 
der denoted by the adjective used for testing. 
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occur as purely formal. Nothing predicts their presence or absence, 
and they basically stipulate the agreement patterns, i.e. the declen- 
sion classes of the agreeing items. 

Finally, there is also a number of animate nouns controlling fem- 
inine agreement, which do not trigger female, and even tend to trig- 
ger male presuppositions, such as pijanica 'drunkard', budala ‘fool’, 
spavalica 'sleeper'. An approach stipulating the feature FEM on these 
nouns would correctly predict their agreement, but would incorpo- 
rate a problematic property, namely that an animate noun carrying 
feature FEM triggers male presuppositions. Even worse, hybrid nouns 
like komsija ‘neighbour’, gazda ‘master’, vladika ‘bishop’ would need 
to be specified for not bearing the feature FEM in the singular, but 
receiving it in the plural, while triggering male presuppositions in 
both sets of forms." 


(9) 

a. teSk-a pijanic-a b. nas-@ komáij-a c. na&-e kom$ij-e 
heavy-FEM drunkard-i! — our-MAsc neighbour- — our-FEM.PL neighbour-IIl.PL 
‘heavy alcoholic ‘our neighbour *our neighbours' 


Mass nouns in the broad sense determined by our test make up 
81,6396 of class IV and 72,6796 of class II nouns. Among class I and 
class III nouns, they are represented with only 7,1696 and 26,2196, 
respectively. 

In class II, this is mostly due to the productivity of the suffix -VV.je 
and the abstract mass individual-to-property denominal suffix -stvo 
(9,3296), as these suffixes derive almost only mass nouns. 

Due to the productivity of the suffix -ost, the percentage for class 
IV is expected to significantly increase when lower frequency nouns 
are included. 

However, the optimism from the sharp contrasts is relativised by 
the fact that still a quarter of class III and 162 nouns from class I 
have mass denotations and would need to be idiomatically stored in 
case mass semantics were used as a feature that takes part in deriv- 
ing the declension class. 

Collective nouns, which I take in the narrow SC-specific sense of 
nouns that denote groups of entities and allow both singular and 
plural number agreement, are generally rare - no class reaches 196 
of collective members. Still, differences indicate a clear split in the 


7 One may, of course, always introduce an additional gender feature, such that one 
is responsible for agreement, the other for the interpretation (see e.g. PuSkar 2017), 
but this is a costly methodological choice - especially considering that an additional 
account is still needed for how these two features determine the realisation of the in- 
flection (i.e. the declension class). 
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tendencies. It seems that class I is closed for collectives: even though 
it is the largest class, it does not have a single collective member. 
Class III has 0.1996 collective members, class IV 0,5696 and class II 
0,8396. Part of the reason is that class II and class IV have morpho- 
logical operations deriving collective nouns (suffix -VV.je with nom- 
inal bases and suffix -ad with class II nouns as bases, respectively). 

Together, mass and collective nouns indicate 1) that classes II and 
IV are much more likely to be homogeneous (i.e. cumulative and di- 
visive) in mereological terms (e.g. Krifka 1989), i.e. to have non-at- 
omised and/or messy semantics in terms of Landman (2011), 2) that 
class I has a strong tendency for quantised, i.e. neat atomised seman- 
tics, and 3) that class III is open for all types of meaning. Similar can 
be concluded from animacy, considering that animacy ranks high on 
the individuation hierarchy, as opposed to mass and collective mean- 
ings. In all these properties, class III ranks in the middle, indicating 
that it is permissive for all the observed properties. 

Let us now skip to the properties annotated for mass nouns only. 

Note that the test used to identify mass nouns is whether the noun 
combines with the mass quantifier sav ‘all’. This quantifier combines 
not only with proper mass nouns, but also with all the nouns that have 
a vague quantity interpretation. The annotation aimed to identify as 
mass only those nouns that combine with the quantifier in their dom- 
inant interpretation, i.e. without accommodating the vagueness. Still, 
the nouns identified as mass ended up including a number of count 
vaguely bounded nouns, as illustrated below - where it was hard to put 
the boundary between dominant and non-dominant interpretations. 
The annotation ofthe type of atomisation identifies this subclass, thus 
splitting the broad class of mass nouns into a big part which is quite 
reliable, and a smaller one with compromised reliability. 
The property atomised was annotated with three values: as atomised, 
when the noun denotes a homogeneous assembly of atomic units (pa- 
sulj ‘beans’, pesak ‘sand’, drustvo ‘society’) or an assembly of units 
varying in the relevant properties (divljač ‘game [wild animals]’,® 
smece ‘trash’, komsiluk ‘neighbourhood’); as count vague, when the 
noun in the singular denotes a singular entity without internal atomi- 
sation, but with vague boundaries (dah ‘breath’, svemir ‘space’, oblast 
‘area’); or as non-atomised, when the noun denotes a proper mass: 
homogeneous, unbounded, without internal atomisation (tecnost 'liq- 
uid’, komfor ‘comfort’, ambicija ‘ambition’). 

Event-denoting nouns presented an additional challenge for an- 
notation, as it is hard to properly map them onto the classification 
above. The following strategy was opted for nouns denoting states, 
and pure processes without any specification of a culmination like 


8 Thelatter two types were separated in the property type of atom. 
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Cekanje ‘waiting’, egzistencija ‘existence’, rad ‘work’ were annotated 
as narrow (i.e. non-atomised) mass. Those denoting single instanc- 
es of a rather bounded eventuality like isplata ‘payment’, podsticaj 
'encouragement', bekstvo ‘escape’, were annotated as vague, as they 
indeed have plurals and are characterised by vague boundaries like 
vague count nouns above. Finally, nouns preserving the secondary 
imperfective semantics and therefore having an iterative interpre- 
tation (ispaljivanje ‘firing’, izvrtanje ‘turning inside out’) were anno- 
tated as atomised - assuming that single iterations of the eventual- 
ity constituting potentially unbounded assemblies are equivalent to 
atoms constituting a homogenous unbounded mass. The aggregate 
distribution is given in table 4. 


Table4 The distribution of types of atomisation among mass nouns (96)? 


Class! Class Il Class III Class IV 
atomised 12,55 38,86 21,64 5,91 
vague 31,39 13,90 11,56 18,34 
mass 56,05 41,23 66,79 75,73 


Relative to other classes, class I has a higher proportion of vague 
count nouns, nouns that only marginally display mass semantics and 
essentially are weakly quantised. Regarding the remaining two op- 
tions, class I has a distribution matching the overall average ratio for 
these two atomisation values. This additionally supports the treat- 
ment of this class as highly individuated. One may wonder why there 
are non-fully-individuated nouns in this class in the first place - but 
recall that we are anyway dealing with tendencies, and that the per- 
centage of such nouns in the class was very low. Class II has a signif- 
icantly higher proportion of atomised mass nouns than other class- 
es, and class IV the opposite: a larger number of non-atomised, i.e. 
narrow mass nouns. This now provides a contrast, even if only ten- 
dential, between class II and IV. While they both have a tendency 
for mass interpretations, class IV tends to have a proper mass inter- 
pretation, and class II rather tends to be atomised. Class III is close 
to the average overall distribution again, except that it has a some- 
what lower participation of vague nouns. This confirms again that 
this class does nottake part in the competition in the domain of prop- 
erties of quantity. 

To make sure that the choices made regarding the annotation of 
event-denoting nouns did not distort the distribution, I consider also 
the distribution when event nouns are disregarded, as given in table 5. 


9 Chi = 113,9331, p < 0,00001. 
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Table5 The distribution of types of atomisation among mass nouns without event 
nouns (96) 


Class! Class Il Class III Class IV 
atomised 15,90 22,22 26,33 4,87 
vague 32,38 35,85 10,97 17,68 
mass 51,70 41,91 62,69 77,43 


Only class II shows a notable difference, in particular in the increase 
in vague count nouns at the expense of the other two types. The per- 
centage of vague count nouns has actually tripled, and makes class 
II the richest of all classes in this type of nouns. Considering that in 
class II pretty much all event-denoting members are derived by one 
suffix, the distribution without it is possibly more telling about the 
class.!° Crucially, however, the change only makes stronger the con- 
trast between class II and class IV regarding narrow mass mean- 
ings on the one hand, and the atomised interpretations on the other. 
Finally, the property abstract had the distribution in table 6. 


Table6 The distribution of abstractness among mass nouns (in %)** 
I Il I IV 


abstract 64,12556 85,09213 76,49254 89,34911 
concrete 35,87444 14,90787 23,50746 10,65089 


Once again, the grouping is into class II and IV as more abstract, in 
class I as more concrete and in class III as having average distribu- 
tion. Considering that abstractness is lower in the individuation hier- 
archy than concreteness, this too supports the view on which classes 
II and IV tend towards lower values of individuation, class I towards 
the higher end, and class III is open for various degrees of individua- 
tion. The strong contribution of the suffix -VV.je to class II has a sig- 
nificant influence regarding this property too. As the suffix almost 
only derives event nouns and events are abstract entities, without 
these nouns, this class ranks very low in abstractness (but other per- 


10 Since in derived nouns it is the suffix that determines the declension class, the en- 
tire set of nouns derived by one suffix might need to be counted as one member of the 
declension class (this is even more obvious on certain views of derivational morpholo- 
gy, e.g. Lowenstamm 2014; Simonovié 2020 for South Slavic). This principle of quanti- 
fication makes the system even more sharply divided in two small and two large class- 
es (class IV would have 41 members, class II 102, class I 1,559 and class III 699). Since, 
however, the same suffix does not necessarily assign the same properties of quantity, 
I chose counting each noun separately as the general approach, leaving an investiga- 
tion along the lines above for the future, specifically for topics where issues that more 
strictly correlate with suffixes will be in focus. 


11 Chi? = 56.7812, p < 0.00001. 
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centages would drop too, because event-denoting nouns are a signif- 
icant group in all declension classes). 

In summary, grammatical gender has a high capacity in predict- 
ing declension class, but due to the same agreement pattern of class- 
es III and IV, the capacity is higher when the relation is inverted, i.e. 
when declension class predicts gender. Natural gender helps account- 
ing for just over 496 of the data. 

Properties of quantity have a broader coverage. Class I has a very 
strong tendency for individuation: it tends to include animate nouns, 
is an unlikely class of mass nouns in the broad sense - and when it in- 
cludes them, it goes for the vague count and atomised, as well as for 
concrete, rather than non-atomised and abstract nouns. Still, this is 
a tendency and not a rule - class I includes 176 mass members, and 
94 others that are ambiguous between a strict count and a mass in- 
terpretation in the broader sense.” The significance of these quan- 
tities is strengthened by the fact that class I is syncretic with class 
IV in the most frequent forms (NomSg, AccSg, NomPl) and with class 
II in almost all the remaining ones, and that nouns not only switch 
between class I and class IV, i.e. class II, but there are some that at 
least for some speakers may be declined in two classes (bol 'pain', 
olos 'scam', polen ‘pollen’, i.e. auto ‘car’, radio ‘radio’, dinamo 'dyna- 
mo’). This latter fact might actually be offering a partial answer too: 
class IV is almost closed (see Simonovic 2020) and its members are 
gradually switching to class I, probably exactly due to a reanalysis 
based on the frequent syncretic forms. Class I is possibly taking mass 
members due to a gradual disappearing of class IV and the syncre- 
tism between them, which drives the reanalysis. 

Class I could hence in principle be modelled in terms of feature 
specification as +ATOM +NEAT (Landman 2011). For this, its 270 mem- 
bers that (also) have mass interpretations need to be treated as id- 
ioms, i.e. as class I members stored in the lexicon with a mass or 
vague interpretation in spite of the strict count feature specification 
that they bear. Already these numbers, received on a sample of 4,718 
nouns, present quite a large quantity of nouns to be stored, and the ac- 
tual numbers for an average speaker are likely five to ten times higher. 

Classes II and IV are inanimate (the 7 overall individuated animate 
nouns they include are plausibly stored), with a strong tendency for 


12 The reason why the regularities are only tendential is that they probably emerge 
as surface generalisations motivated by heterogeneous factors including language use. 
Their base is the interaction between the lexicon, grammar and pragmatics in language 
use, and they are restricted by various properties of the grammar and the lexicon (such 
as the gradualloss of class IV, the general preference to assign less marked declension 
classes, phonological properties of the stem favouring one declension class or anoth- 
er, specification of the derivational suffix for derived nouns or the shape of the noun in 
the donor language for the borrowed ones). 
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mass interpretations in the broad sense. When it comes to proper- 
ties of quantity, class IV is tendentially specialised for non-atomised 
meanings, especially for property-denoting abstract nouns, and class 
II for atomised mass, especially for event-denoting abstract nouns. 
Class II could hence be modelled as bearing the -NEAT feature, and 
class IV as bearing the -ATOM feature. Again, this requires that prop- 
er count nouns in class II and atomised nouns in class IV be repre- 
sented as idioms interpreted differently than indicated by their fea- 
ture specification. Even though the percentage of these exceptions is 
higher, due to the small size of the classes, the absolute numbers are 
similar to class I. In the sample of 4,718 nouns, this yields 197 class 
II and 50 class IV nouns. Due to a high productivity of the suffixes 
-VV.je and -ost, the percentage of exceptions is likely to fall with the 
expansion of the sample to a speakers's entire lexicon, and the abso- 
lute numbers should not grow as much as in class I, but they are still 
several times higher than those attested. 

The fatal hit for a realisational approach based on the properties 
of quantity comes from class III, which is open for all types of inter- 
pretations. If modelled in terms of features, this class can be charac- 
terised as lacking any of the relevant features, i.e. as having them all, 
and being realised based on the superset, i.e. subset principle. The 
problem is that there is no noun for which this class does not com- 
pete with at least one other class which is more specific and hence 
must win as the realisation of the declension class of the noun. The 
only way for this class to ever win is that its feature specification is 
irrelevant (hence probably absent), and all its members are stored 
as idioms (similar to Caha's 2021 proposal). Effectively, this means 
that, by extremely conservative means, at least 4096 of the nouns in 
the lexicon are effectively assigned a declension class in whatever 
storage the framework postulates for idiomatic nouns, and at most 
6096 receive their declension class via realisation based on their fea- 
ture specification. It is hard to argue that a model employing both 
realisation and an equivalent of lexical specification is better than 
one which entirely relies on the lexical assignment of the declension 
class as a feature. In the next paragraph, I propose an alternative 
model, in which declension class is a feature assigned in the lexicon, 
weakly depending on a range of semantic properties, some of which 
are independently represented in terms of other features (such as AT- 
OM or NEAT), others being a matter of LF interpretation (such as gen- 
der). Tendential regularities discussed in this paragraph then must 
represent surface generalisations constituting the operation of de- 
clension class assignment, which has independently been argued to 
be sensitive to various often clashing factors (segmental phonology, 
prosody, properties of quantity, natural gender, borrowed suffix), see 
e.g. Nesset 1994. A tentative analysis would have all these interac- 
tions captured simply in terms of markedness: more marked values 
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of one feature (declension class) tend to match with more or with less 
marked values of other features. 


5 The Proposal: Nouns are Specified for Declension Classes, 
Interpretive Effects Emerge in Pragmatic Competition 


I have so far provided 4 major arguments against a realisational 
treatment of declension classes: 

1. gender agreement is effectively the declension class displayed 
by the agreeing item, and hence the grammatical gender is 
effectively also a declension class specification on an item 
which can be specified for any of the three agreeing declen- 
sion classes available; 

2. the mapping from grammatical gender to declension class- 
es is not fully defined, since feminine gender may map on- 
to class III as well as class IV; the mapping from declension 
classes to gender (i.e. from the declension class on the noun 
to the declension class on the agreeing item) does not suffer 
from this problem; 

3. grammatical gender needs the information about the natural 
gender to predict the declension class of hybrid gender agree- 
ment nouns, but natural gender is not a likely syntactic fea- 
ture (it is not discrete, and the corresponding meaning may 
emerge via pragmatic competition); 

4. properties of quantity also have a power to predict a signifi- 
cant portion of declension classes, but also with a non-mar- 
ginal set of exceptions and with a particular problem to cap- 
ture class III and its diachronic stability. 


Instead, I argue, declension classes are better represented as fea- 
tures in the nominal projection forming a markedness hierarchy as 
follows: IV < I < II < III.? Agreement then unfolds in the following 
way, which is similar in spirit to the analysis proposed by Kucerova 
(2018), but with declension classes as the basic feature and with a 
more conventional rescue strategy (she postulates look-ahead outside 
of the language faculty, into the conceptual-intentional system). Syn- 
tax first determines the syntactic domain for the controller of agree- 
ment. Then, it inserts copies of a declension class value from this do- 
main onto the agreeing item (as no copy can be inserted for class IV, 


13 The formality of representation may be further increased by a Nanosyntax style 
of representation, where, for instance, class I is represented as just the class feature 
in the respective projection, class II as the absence of this projection (see Arsenijević 
2017), class IV as the class feature and a mass feature adjacent to each other, and class 
III as two consecutive class projections. 
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this class resorts to class III as the most marked). Ideally, it copies 
the features of the projecting noun, but there are alternative options 
too which I do not discuss in this paper (but see for instance Willer 
Gold et al. 2016 and Arsenijevic et al. 2021). The structure is even- 
tually spelled out and interpreted at PF and LF. In contexts in which 
the declension class is interpretable, it triggers presuppositions at 
LF. When these presuppositions clash with stronger presuppositions 
that are active in the discourse, the derivation crashes. Upon a crash, 
syntax may replace the declension class with the default and spell it 
out again. If no clash emerges, the derivation converges. Let me il- 
lustrate this on a pair of examples. 

In both sentences in (10), the subject is a human-denoting nomi- 
nal expression from class III. Therefore, this class is copied onto the 
agreeing item, and it is spelled out. Feature HUMAN causes the de- 
clension class feature to trigger gender presuppositions at Lr. The 
gender presupposition triggered by class III is female. Here, the pre- 
supposition is not the content of the declension class, nor a proper- 
ty assigned to it - but rather a pragmatic mechanism relying on the 
procedure of declension class assignment in the lexicon, which in- 
cludes natural gender as a parameter. When a declension class shows 
a strong tendency regarding some natural gender, pragmatics takes 
the occurrence of this declension class in agreement as a source of 
weak presupposition of the respective natural gender (similar holds 
for the properties of quantity). In (10a), this presupposition clashes 
with the presupposition independently available about the referent 
(here, a strong male presupposition is triggered by the fact that the 
noun komšija stands as the unmarked member in the natural gender 
opposition with the feminative komsinica ‘female neighbour’; for var- 
ious degrees of strength of the gender presuppositions depending on 
the source, and for additional, experimental evidence for this view, 
see Arsenijevic et al. 2021).! The derivation crashes. As an attempt of 
repair, syntax now spells the structure out with the default declension 
class I on the agreeing item. This declension class triggers a male or 
gender-unspecified presupposition and the derivation converges. In 
(10b), all is the same until the structure is spelled out with the class 
III specified on the agreeing item. Here, however, the noun sestra 'sis- 
ter' triggers a strong female presupposition due to being the marked 
member in the natural gender opposition with the noun brat 'broth- 
er’. Presupposition triggered by agreement matches the independent- 
ly available strong presupposition, and the derivation converges. 


14 Inthe perspective of Distributed Morphology (Halle, Marantz 1993), the relevant 
information is whether there is a lexical realisation available for a more marked coun- 
terpart that may saliently realise the female gender. This information does not need ac- 
cess to the vocabulary as it is arguably available in the encyclopedia. 
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(10) 
a. Komši-a je spava-o-O 
neighbour-III.NOM.SG AUX.SG sleep-PCL-I.SG 


‘The neighbour was sleeping’. 


ai” al 


due vspav 
CLASS III 
N Vkomsi 
b. Sestr-a je spava-l-a 
sister-III.NOM.SG AUX.SG sleep-PCL-III.SG 


‘The sister was sleeping’. 


DEF 


ala 
class-1/Class- 
we 


cS Vspav 


CLASS III 


N vsestr 


Something similar happens when class II nouns are used in contexts 
presupposing neat quantity structures, as in conjoined singulars trig- 
gering the sum-interpretation illustrated in (11). 
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(11) 

Sel-o i polj-e su potopljen-i / *potopljen-a 
village-NEUT.sc and field-NEUT.SG are.PL flooded-MAsc.PL / flooded-NEUT.PL 
‘The village and the field are flooded’. 


Here, again, syntax first copies class II onto the participle. This trig- 
gers the presupposition of a messy quantity structure, which clashes 
with the strong presupposition of a neat quantity structure triggered 
by the conjunction of two count singulars. The derivation crashes, 
and syntax repairs it by inserting the default class I feature on the 
agreeing item. The derivation then converges. 

The correlations that declension classes display with grammatical 
gender (i.e. with declension classes on agreeing items), natural gen- 
der and properties of quantity are best captured in terms of prag- 
matic competition between the degrees of markedness of declension 
classes sensitive to the degrees of markedness of natural gender, i.e. 
of properties of quantity. There are two different domains in which 
this competition shows effects. One is the lexical assignment of de- 
clension classes to nouns, where the competition contributes to the 
selection of declension class assigned to the noun. The other is the 
assignment of declension class to the agreeing items, traditionally 
referred to as gender agreement, where the competition determines 
the presuppositions triggered in the domain of gender and/or prop- 
erties of quantity. Let me briefly assess each of the relevant cases. 

When it comes to properties of quantity, I assume the four types 
of meanings already considered in 8 4 to be ordered regarding mark- 
edness in the following way: non-atomised mass « atomised mass « 
vague count « strict count. The extreme values on the scale map on- 
to the first two declension classes: non-atomised mass to class IV, 
strict count to class I. The remaining two values: atomised mass and 
vague count, which are also quantitatively more marked (i.e. they 
form significantly smaller classes in the lexicon, see $ 4), target the 
next declension class in the hierarchy: class II. Hence, class IV tends 
to take mass non-atomised nouns, class I strict count nouns, and class 
II tends to take mass atomised and vague count nouns. Class I is more 
likely to take a vague count or an atomised mass noun than a non- 
atomised mass noun, and class IV is more likely to take an atomised 
mass noun or a vague count noun than a strict count noun. For rea- 
sons yet to be investigated, class III remains outside this mapping 
and receives members based on other properties, such as expres- 
siveness, segmental phonology, prosody. Note that it is independently 
clear that other properties play a role, from the fact that even class- 
es I, II and IV do not fully map to properties of quantity, but rather 
show a tendency. Two of those additional properties, natural gender 
and expressiveness, are discussed below. 
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To illustrate this, consider the nouns in (12). The base konj 'horse' 
denotes an animate individual, hence with properties atomised and 
neat, and sitting relatively high in the individuation hierarchy. This 
supports the assignment of class I. The base sen 'hay' denotes an at- 
omised mass, which supports the assignment of class II, and the suf- 
fix -ost introduces the nominal category for a base that denotes a 
property, supporting the assignment of class IV. Finally, it remains 
unclear on the proposed account why the suffix -b is specified for se- 
lecting class IIT, as both semantically and phonologically (i.e. to se- 
cure a vowel after the consonant cluster which it typically forms), it 
could as well take class II.!5 


(12) 

konj sen-o mlad-ost ber-b-a 
horse.1 hay-ll young-N.IV pick-N-111 
‘horse’ ‘hay’ ‘youth’ ‘harvest’ 


In agreement, as stated above, the declension class copied from the 
controller is the first option, but the default is an available option too. 
Hence, the competition is more intensive than at declension class as- 
signment, as the choice of the declension class is made anew in every 
context (a noun is assigned class once and keeps it in all its uses). In 
salient contexts, the selection of a marked declension class triggers 
the presupposition of the holding of a marked property along the re- 
spective dimension. This is the reason for the agreement quirks of 
class II nouns reported in Arsenijevic (2017), including those report- 
ed in (7), repeated as (13). Class II is the most marked class when it 
comes to properties of quantity - a class latently mapped with vague 
count and atomised mass nouns. Hence, it triggers the presupposi- 
tion of the corresponding meaning. The pointing context, animate 
plurals, as well as conjunction, imply assemblies of highly individu- 
ated entities, yielding a clash. The pointing is resolved by using the 
class I form of the demonstrative (oni ‘they.I1’) and the agreeing form 
of the predicate, the plural by using the collective plural form from 
class IV (telad 'calves.coLr), and the conjunction again by resorting 
to class I, this time on the agreeing item only. 


15 An anonymous reviewer suggests that arbitrary choices and mismatches of the 
type discussed could be a trace of diachronic processes. If it is correct that declension 
class assignment proceeds in the interaction of grammar, pragmatics and the lexicon 
inlanguage use, it is indeed plausible that current assignments are traces of some ear- 
lier stages in which they were the optimal choices - under the condition that the later 
stages of development did not render them (strongly) suboptimal. 
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(13) 

a. Ona *(deca) su neumorna. (pointing at children) 
those..PL  child.II.PL AUX.PL tireless.I.PL 
‘Those children are tireless’ / * Those are tireless’. 


b. dugm-e dugm-eta tel-e *tel-eta tel-ad 
button-11.SG — button-u.PL — calf-i.sc calf-11.PL calf-SUFF.IV.SG 
‘button’ ‘buttons’ ‘calf? 

c. Sel-o i polj-e su potopljen-i /  *potopljen-a 


village-i.sc and field-il.sG are.pL flooded-1.pL / flooded-l.PL 
‘The village and the field are flooded’. 


Natural gender is a property of animate nouns. Moreover, it only 
shows on the strictly count ones among them - as natural gender is 
a property of an individual, not of a group. Since strictly count ani- 
mates are very high on the individuation hierarchy, and since, due to 
the mapping to properties of quantity, classes II and IV do not take 
nouns so high on this hierarchy, the opposition in natural gender only 
establishes competition between classes I and III. Regarding marked- 
ness, masculine natural gender is unmarked and feminine is marked. 
The mapping is such that the value of gender characterised by high- 
er markedness maps onto the declension class that is more marked. 
Asa result, all the nouns denoting females are assigned class III, and 
a majority of nouns denoting males are assigned class I (of the 139 
natural masculine gender nouns in the database, 12 belong to class 
III, together with all the 52 natural feminine nouns). Again, class III 
turns out to be open for all the values, while class I is exceptionless 
in not taking one of the values - the feminine natural gender. 

To illustrate this, in the respective culture, the prototypical ant is 
represented as male, and the prototypical bee as female. As the only 
two classes that can take a (non-diminutive) animate noun are class 
I and III, class I wins for the ant because it is does not clash with the 
male presupposition and is the default class, and class III for the bee, 
as it does not clash with the female presupposition. The agentive suf- 
fix -ar derives the human denoting noun rudar 'miner' and, as the 
prototypical human in the culture is conceptualised as a male, the 
noun receives class I. The phonologically empty suffix -k is added to 
this noun to derive a marked minimal pair. This markedness can be 
interpreted along several lines: as a diminutive, as the opposite gen- 
der, as an expressive variant (a similar observation is made in Ca- 
ha 2021; see Potts 2007 for a discussion of expressive semantics). In 
the particular case, the functional load determines the marked gen- 
der, i.e. female interpretation as the relevant dimension. Since the 
corresponding presupposition matches class III, this is the class as- 
signed to the noun. 
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(14) 

mrav péel-a rud-ar rud-ar-k-a 
ant bee-Ill mine-er.I mine-er-K-I 
‘ant’ ‘bee’ ‘miner’ ‘she-miner’ 


In agreement too, the competition only emerges with humans and 
entities higher than that in the hierarchy. Hence, the competition is 
between classes I and III only. The more marked class, class III, trig- 
gers the presupposition of the more marked natural gender value, 
feminine (since the opposition is binary, class I favours the masculine 
gender by antipresupposition). When the declension class is copied 
in agreement, since agreement is reinterpreted for presupposition 
in each context as discussed above, a clash may emerge, which gets 
resolved by the spell out of the default declension class. 

Caha (2021, 37) generalises affectiveness as a property of male- 
denoting members of the Russian counterpart of class III. Indeed, 
expressiveness occurs to be a third dimension of markedness that 
interacts with declension classes that arguably enters a mapping of 
the above type. Assuming that nouns with expressive meanings are 
marked compared to those without them, a mapping can be estab- 
lished where the marked value of expressivity maps higher rather 
than lower on the declension-class-markedness scale. Unfortunate- 
ly, the database does not yet have a complete annotation of the nouns 
for expressivity, but of the 26 hybrid gender class III nouns in the da- 
tabase, 15 have a clear expressive meaning, ranging from the posi- 
tive (deda, deka ‘grandpa’, Cika, teca ‘uncle’, beba ‘baby’), via the am- 
bivalent (voda ‘leader’, gazda ‘master’, pristalica ‘supporter’), to the 
negatively biased ones (ubica ‘murderer’, budala ‘fool’, ustaša ‘mem- 
ber of the Croatian Nazi movement’).*® 

Consider the examples in (15). The noun Covek ‘man’ and the prop- 
er name Petar refer to men. Therefore, they are assigned declension 
class I. However, their hypocoristic forms Cova and Pera (a typical 
nickname for the name used by close persons), which also denote 
men, are assigned class III because of their expressive component. 


(15) 

Covek Cov-a Petar Per-a 

man.I man-ill Petar. Per-il 

‘man’ ‘man (hypocoristic)’ ‘Petar’ ‘Petar (hypocoristic)’ 


16 The noun ustaša in fact has also been bearing a positive expressive value for cer- 
tain speakers, but it has never been neutral. 
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Similarly, as already presented in (3) above, an expressive compo- 
nent may trigger a noun to display mixed class I/II behaviour, as illus- 
trated in table 7 below (note that in the nominative and accusative of 
class II, the endings e and o are in phonologically conditioned alterna- 
tion), where the expressive hypocoristic noun smrdljivko ‘stinky-SUFF- 
1’, unlike its neutral counterpart smrdljiv-ac-@ ‘stinky-SUFF-I’, shares 
with class II its typical nominative singular ending and the lack of 
plural forms for animate nouns, while having the accusative ending 
characteristic for class II and showing class II agreement behaviour. 


Table?  Expressivity-induced hybrid between classes and II 


Class! (<Ø, a>) Expressive noun Class Il (<o/e, a>) 

Sg Pl Sg Pl Sg PI 
NOM smrdljiv-c-@ smrdljiv-c-i smrdljiv-k-o / pile / 
GEN smrdljiv-c-a smrdljiv-c-a smrdljiv-k-a / pil-et-a / 
DAT smrdljiv-c-u smrdljiv-c-ima smrdljiv-k-u / pil-et-u / 
Acc smrdljiv-c-a smrdljiv-c-e smrdljiv-k-a / pil-e / 
INST smrdljiv-c-em  smrdljiv-cima smrdljiv-k-om  / pil-et-om / 
Loc smrdljiv-c-u smrdljiv-c-ima smrdljiv-k-u / pil-et-u / 


Declension class assignment is sensitive to other factors too, such as 
phonology (bases with final consonant clusters tend to take declen- 
sion classes with overt endings in the nominative rather than those 
with null morphology) or the shape of the noun in the donor language 
for loan nouns (nouns ending in -a in the donor language take class III 
declension, considering that there are no obstacles for such assign- 
ment, since class III is semantically flexible), but I do not discuss these 
aspects here beyond observing that an account realising declension 
class from gender is not likely to capture them in a transparent way. 

The presented view is parsimonious compared to the alternatives, 
as it only specifies nouns for one relevant feature (class) instead of at 
least two (grammatical and natural gender, let alone the role of atom- 
isation and neat vs. messy structure). It derives grammatical gender, 
i.e. the declension class on the agreeing item, from that on the con- 
troller of agreement rather than the other way around and does not 
face the problem of ambiguity of feminine agreement between class 
III and IV realisation. It is compatible with the findings regarding 
the pragmatic competition between minimal pairs (Merchant 2014; 
Sudo, Spathas 2019). Finally, it does not have to store a significant 
amount of lexical material or forms as idioms interpreted contrary 
to their feature specification or otherwise face a large number of ex- 
ceptions. The approach is similar in spirit to Kucerova's (2018), in in- 
voking a semantic evaluation of agreement, but differs both in the 
scope and in the technical implementation. 
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6 The Morphology of Declension Classes 


In this paragraph, I propose a novel morphological analysis of de- 
clension classes in SC, which supports the markedness ordering as- 
sumed in the discussion and models provided in the preceding par- 
agraphs: IV « I « II « III. The analysis capitalises on the fact that 
class I has not a single ending that is not syncretic with at least one 
other class, counting only same case and number counterparts, as 
obvious from table 1, repeated as table 8. In the singular, class I nom- 
inative and accusative are syncretic with class IV, class I genitive, 
dative/locative and instrumental with class II. In the plural, nomi- 
native is syncretic with class IV, accusative with class III, genitive 
with both classes II and III, and dative/locative and instrumental 
with classes II and IV. I take this as an indication that class I has 
no endings of its own: itis always realised by the endings from oth- 
er declension classes. 


Table8 Declension classes in Serbo-Croatian 


Class! (<ø, a>) Class II (<o/e, a>) Class Ill (<a, e>) Class IV (<Ø, i>) 
Sg Pl Sg Pl Sg Pl Sg Pl 
NOM mrav-@  mrav-i mor-e(sel-o) mor-a rib-a rib-e noć-Ø noć-i 
GEN mrav-a mrav-a mor-a mor-a rib-e rib-a noc-i noc-i 
DAT mrav-u mrav-ima mor-u mor-ima  rib-i rib-ama noci noc-ima 
ACC mrav-(a  mrav-e mor-e(sel-o) mor-a rib-u rib-e noć-Ø noć-i 
(žig -Ø) 


INST mrav-em mrav-ima mor-em(sel-om) mor-ima rib-om rib-ama  noc-ju noć-ima 


LOC mrav-u mrav-ima mor-u mor-ima  rib-i rib-ama noci no¢-ima 


Moreover, I use the same feature, DIV, to represent both the genitive 
case and the plural (see Arsenijevic 2005 for an elaboration). Gen- 
itive plural has the feature twice, once in the head realising gram- 
matical number and once in the head realising the case. 

(16) GEN sc: [p DIV[ 9] NOMP: [pI 


DIV]] GEN PL: [,, DIV [ DIV]] 


NumP NumP NumP 


This accounts for the fact that the genitive singular forms of classes 
II, III and IV are identical to their nominative plural forms. Class I 
takes the plural form of class II for reasons to be briefly assessed. It 
also explains why genitive plural forms have the form of the nomina- 
tive plural with a doubled ending (class I and III take the form from 
class II), again for reasons to be briefly discussed. 

Finally, accusative is syncretic with the nominative, except in class 
III singular, where it has its own ending. I leave aside the differen- 
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tial object marking in class I, where animates have an accusative 
ending that is not copied from the nominative cell, but rather from 
the more marked genitive. Similarly, in plural, class I takes an ac- 
cusative plural that is not identical to the nominative plural that it 
takes from class IV, plausibly due to a tendency to pick a more dis- 
tinctive marking. 

Assume further that class II has a mid vowel (V pia) as its theme 
vowel, class III simply a vowel (V), and class IV i. By default, V... is 
realised as o (as confirmed by its realisation after a vowel), but af- 
ter palatal consonants it gets realised as e. By default, V is realised 
as a. When the case ending starts with a vowel, in those forms that 
attach with a theme vowel, the theme vowel remains in the form of a 
zero vowel, and blocks the phonological interaction of the stem with 
the suffix. In some forms this is subject to variation, as illustrated 
in (17), where speakers of one grammar do and speakers of the oth- 
er do not (drop the zero theme vowel and) palatalize final velars in 
class II in front of endings with an initial front vowel. 


(17)  pazuh-ima > %pazusima / 9opazuhima (i.e. pazusima / pazuhVima) 
klupk-ima > %klupcima / %klupkima (i.e. klupcima / klupkVima) 


If all observable instances of syncretism are coded, we arrive at ta- 
ble9, where all instances of syncretism between declension class- 
es are marked by reference to the declension class and by the back- 
ground colour in the respective cell, and endings are otherwise 
labelled according to their primary feature content (theme vowels 
are indexed for the class). The table is purely descriptive, for the pur- 
pose of establishing the amount of intra- and interclass syncretism 
in each class. No theoretical significance is intended (e.g. how the 
instances of syncretism emerge is not implied), assuming without 
further evidence that all various approaches to morphology availa- 
ble can derive it one way or another. 


17 lam grateful to Marko Simonović (personal communication) for this piece of evi- 
dence, as well as that in (17). 


18 See Arsenijevic et al. (2021) for the same effect of a theme vowel without a full vo- 
calic realisation preventing the preceding consonant from undergoing palatalization 
in the verbal domain, as illustrated in (i). 

(i) 

za-jah-a-ti > za-jah-V-iv-a-ti > zajahivati / *zajaSivati 
hind-ride-TH-INF hind-ride-TH-IPF-TH-INF 


‘to mount.PRF' ‘to mount.IPF' 
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Table9 Sebo-Croatian declension class system re-analysed 


ci3sg cUsg M ciasg cirt cope — HP cul 
Th 


NOM Th, B Class4 Ø Pl Pl Class4 Th, 

GEN Th,-Acc Nom Class4 Nom Nom Nom Class3 | Nom 
DAT  Th.Pl Th,PI Class2 Th, Class2/4 Plx2 Class2 | Plx2 
Acc Dat Th,-Dat Class2 (Th, Th,Inst Th,-Class4 Class4  Th,-Inst 


INST Th,-Inst Th,Inst Class2 Th,-Inst Th,-Inst Th,-Class4 Class4  Th,-Inst 


As the table shows, class III only has one syncretic slot, genitive plu- 
ral, where it takes a class II or class IV ending.?? Class II ending is 
the default, and class IV ending is conditioned on particular phono- 
logical environments (see e.g. Simonovic, Baroni 2014). The selection 
ofthe class IV ending does not violate the hierarchy. The default end- 
ing is taken from the closest declension class. When a phonological 
context requires a different ending, the alternative is taken from the 
next closest class with a different ending. As classIis syncretic with 
class II in this form, the only remaining choice, i.e. the closest de- 
clension class with a different ending for genitive plural, is class IV. 
Class III has the lowest amount of syncretism within itself and with 
other classes, and the highest number of different own endings. Class 
IV only participates in syncretism relations as a donor. 

Other interesting generalisations include that nominative has no 
ending in any singular form, but while in classes II and III it involves 
the theme vowel ofthe class in a strong realisation (i.e. as a full vow- 
el), in class IV it even lacks this component. The istrumental case 
universally has the ending -m in the singular and -ma in the plural, 
except for the singular form of class IV, where it reduces to -u.?° Oth- 
erwise, class IV has very few endings generally, but also never (ob- 
servably) reduces the theme vowel, i.e. it generally relies more on the 
theme vowel than on endings to realise case forms. 

There is only one instance of syncretism under this analysis which 
skips one declension class in the hierarchy: the genitive plural form in 
class I which is taken from class III. However, since the accusative on 
the proposed analysis tends to be a function of the nominative rath- 


19 Ithankananonymous reviewer for reminding me ofthe fact that a number of nouns 
of this class with stem-final consonant clusters take the ending of class IV. 


20 This alternation is also observed in 1st person singular of the present tense, where 
the ending is -m, but several verbs realise the ending as -u, as well as between the ad- 
jectival and the nominal declension where in classes I and II the former takes -m and 
the latter -u in the dative/locative singular (bel-o-m zid-u ‘white-TH-DAT wall-DAT'). It is 
obviously far-fetched to make any claims, but itis thinkable that in certain contexts -m 
occurs as the default when -u realises the marked alternation. 
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er than an independent ending, this instance of syncretism is plausi- 
bly a property of the function rather than a syncretic ending skipping 
the hierarchy. For instance, it could be that the accusative poses a 
requirement for the plural number feature to be visible, which is not 
satisfied by the two neighbouring candidates, as class IV only has a 
theme vowel in this slot, which also occurs in the singular, and class 
II has a deficient plural (see Arsenijević 2017), but is satisfied by class 
III. The analysis therefore may well feed models with further decom- 
position and reduction, such as for instance Caha's (2009). 

The reason why class I avoid class II nominative in the plural is 
probably also the reason why both class I and class III take the geni- 
tive plural ending from class II. Genitive plural involves a double divi- 
sion, once in the number projection and one more time by the genitive 
case, as in (16) above. Classes I and III include highly individuated 
members (e.g. animate and human nouns), and their plural is hence 
interpreted as a neat division over the restriction. For different pos- 
sible reasons which may involve avoidance of effective higher order 
divisions or the fact that classes I and III also have members with 
messy semantics, a weaker plural is preferred in the context of its 
double implementation. This is exactly what class II plural does, as 
it is accommodated to the messy semantics of this class (recall that 
a majority of its members have messy semantics, in particular as at- 
omised masses or as vague count denotations). I leave a deeper elab- 
oration of this question for future research. 

The analysis in table 7 confirms the hierarchical ordering of de- 
clension classes as introduced in § 5: IV < I < II « III. Not only is 
this supported by the patterns of syncretism, where the given or- 
der involves no skipping (under a derivational realisation of the ac- 
cusative), but also the numbers of different endings monotonically 
decrease, and the occurrences of the theme vowel monotonically in- 
crease from left to right. 


7 Conclusion 


I argued, based on the data from SC, against both views that indepen- 
dently represent both declension classes and grammatical gender as 
features of the noun, and those that try to derive the declension class 
from the combination of grammatical gender and other features. The 
arguments that I put forth include the fact that gender is predicta- 
ble from the declension class of the noun in almost 9696 of the nouns 
(on a sample of the 4,718 most frequent nouns) while gender needs 
additional information to distinguish between the classes III and IV; 
that gender boils down to the declension class of the agreeing item 
and hence cannot be a more basic property of a noun than its own 
declension class; and that at least four families of properties corre- 
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late with declension classes (agreement, natural gender, properties 
of quantity, expressivity) but none correlates absolutely. This last ar- 
gument is supported by a detailed quantitative study of the relevant 
properties on a database of annotated SC nouns. 

An alternative model is proposed, where the declension class is 
assigned to nouns in the lexicon and get copied in agreement from 
the noun onto the agreeing item. Exceptions emerge when the cop- 
ied value triggers false presuppositions regarding the natural gen- 
der, properties of quantity or expressive content, in which case the 
default declension class I is inserted instead. 

Finally, I offered a novel analysis of the SC declension classes, in 
which the optimal arrangement to capture the patterns of syncre- 
tism and the roles of the theme vowels and case endings yields ex- 
actly the hierarchy of markedness postulated in the analysis of the 
mapping of declension classes to natural gender and properties of 
quantity: IV < I < II < IIT. 
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1 Introduction 


It is widely known that both in the Old- and the Neo-Stokavian dia- 
lectal groups of the BCS‘ continuum, there is a pervasive grammat- 
ical constraint on the licensing of independent perfective present 
forms (henceforth Pres”) that refer to different kinds of eventuali- 
ties in certain syntactic environments, such as present-tensed main 
clauses.? The unavailability of Pres* forms in BCS main clauses is ex- 
emplified in (1) below, where the replacement of the inflected Pres” 
form of the ambitransitive predicate pisati 'to write' with the corre- 
spective Pres” form of napisati yields absolute ungrammaticality:* 


(1) Pišem (*Napisem ) sve ovo kao upozorenje novopostavljen- 
WRITE.1.sg.PresiPF > WRITE.1.sg.PresPF^ i . . 
im direktorima na moguca neprijatna iznenadenja. [SrpKor, viva0104_n.txt, Vi- 
va (April 2001)] 
‘Im writing all this to warn all the newly appointed directors of possible un- 
pleasant surprises". 


The Author wishes to express his gratitude to the editors of Balcania et Slavia, three 
anonymous reviewers of FASL 30 for their thought-provoking remarks which helped 
me greatly in reworking and fine-tuning the core of the original proposal, two anony- 
mous reviewers for their meticulous comments on an earlier draft of the paper, and all 
the Stokavian native speakers, members of the Facebook groups South Slavic Linguis- 
tics (https://www.facebook.com/groups/697148470712789) and Kako biste VI rekli? 
(https://www.facebook.com/groups/1649886005251662), who kindly agreed to fill out 
an online questionnaire. Special thanks go to James Hartstein for checking the correct- 
ness and the internal consistency of the text. The usual disclaimer applies. 


1 Throughoutthis article I will adopt the umbrella label BCS (shortly for Bosnian-Cro- 
atian-Serbian) in order to refer collectively to the different varieties of the South Slav- 
ic dialectal continuum which are nowadays spoken in most of the national republics 
that emerged after the collapse of Jugoslavia. BCS is to be taken here as a synonym of 
'Serbo-Croatian', which was the standard working term in linguistics at least up to the 
first major outbreak of Jugoslav Wars in 1991. The term however has been sidelined in 
this paper to prevent the emergence of unwarranted political associations. The Author 
of the present study acknowledges the scientific validity of the views stated in the Dec- 
laration on the Common Language (Deklaracija o zajednickom jeziku) and supports the 
general claim that Bosnian, Croatian, Serbian, and Montenegrin should be viewed as 
national variants (each with equal rights) of the same standard South Slavic polycentric 
language (for more technical evidence in support of this claim see also Bailyn 2010). The 
text ofthe Declaration is available at the following link: https://jezicinacionalizmi. 
com/deklaracija. Where not specified otherwise, all translations are by the Author. 


2 Eventuality is here used (in the original spirit of Bach 1986) as a cover term for both 
states and non-states (processes, events), while the definition of a main clause (i.e. a 
clause which can stand on its own, in opposition both to matrix and subordinate claus- 
es) is given after Haspelmath (2020, 603). 


3 Inlinguistic theory the concept of grammaticality has proved to be notoriously hard 
to disentangle from the (apparently overlapping) notion of acceptability. For the sake 
of clarity we stick to the results of the discussion recently elaborated in Leivada, West- 
ergaard 2020 and Leivada, Murphy 2021, 4-5, where it is claimed that (un)grammat- 
icality, unlike (un)acceptability, is more of a twofold concept (on the axis ‘absolute’- 
‘relative’) than a scalar notion. 
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This constraint has drawn considerable attention over recent dec- 
ades. In the cognitive literature the functional markedness of gram- 
matical forms with present time reference - resulting from the com- 
bination of the aspectual category 'perfective' and the temporal 
category 'present' - has been labelled as the present perfective par- 
adox (cf. among others De Wit 2017). It has been proposed that such 
markedness holds cross-linguistically, and is mainly triggered by the 
cognitive difficulty of processing and representing as a single whole 
an eventuality token still unfolding at the UT.* In other words, ei- 
ther the eventuality is predominantly represented as holding at the 
UT (thus focusing on its outer temporal anchoring), or is seen as a 
single whole (thus zooming in on its inner temporal properties). In 
more formal terms, adopting a Kratzerian definition of PF according 
to which the event time interval needs to be included within the ref- 
erence time interval (i.e. for present-tensed eventualities, the near- 
instantaneous UT), Todorović (2015, 87-8) argues that the impossi- 
bility of satisfying the inclusion requirement of PF within the time 
interval selected by the UT in SpecT leads to the impossibility for the 
event to be temporally located, and consequently triggers the abso- 
lute ungrammaticality of main clause Pres» forms.5 

De Wit (2017, 38) claims that languages can solve the present per- 
fective paradox resorting to three main repair strategies, i.e. assign- 
ing a futural (the prospective strategy), past (the retrospective strat- 
egy), or non-actual reading (the structural strategy) to eventualities 
realised with Pres” forms. Unlike East Slavic languages, where main 
clause Pres" forms have been prototypically reanalysed as aspectual 
futures (e.g. Russian procitaju,.,,, ., preser 1 Will read’), or even West 
Slavic languages, where main clause Pres» forms can be alternatively 
used either as aspectual futures or modally nuanced non-actual pre- 
sents (e.g. Czech koupim,,,,1.sg.Presrr ‘I will buy’, ‘I buy’), in almost all 
the South Slavic languages spoken in the Balkans* Pres» forms have 
retained only their non-actual function, and thus are unable to refer 
to eventuality tokens which are anchored to the UT. 

Old- and Neo-Stokavian dialects of BCS seemingly adopt a rein- 
forced structural strategy’ as the licensing of non-actual Pres forms 


4 The Utterance Time (UT) is here identified with the so-called reference time inter- 
val which, according to the proposal of Demirdache and Uribe-Extebarria (2004), oc- 
cupies a structural position in SpecT. 


5 Note that this formalisation, despite managing to nicely capture the BCS pattern, 
is not unproblematic per se, for it requires several successive adjustments in order to 
hold true from a cross-linguistic perspective. 


6 Exceptions are made for some Kajkavian and Cakavian dialects of the Croatian va- 
riety of BCS, which will not be taken into account in this survey. 


7 Inthis respect BCS imposes even more restrictions on the licensing of Pres* forms 
than other South Slavic languages (e.g. Bulgarian) and behaves more similarly to oth- 
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is restricted to embedded clauses (cf. (2), whereby procitam,,,, , 59. 
pes, I reaď is licensed inside a clause of purpose introduced by 
the complementizer da),? interrogative-negative contexts? (cf. ne 
dodete 'you don't come' and ne probate 'you don't 


COME.2.pl.PresPF TRY.2.pl.PresPF 


try' featured in the whimperative structure in (3)) or even chains of 


er languages of the Balkansprachbund. In Modern Greek, for instance, present tense 
forms built from the aoristic stem (synoptikoi 'momentaneous') cannot occur in main 
clause contexts and can be licensed either in embedded environments introduced by 
the subjunctive marker na (e.g. Mporó na alláx,,, ss 1.sg.Prosoe LS glosses óses forés thélo 
' can change languages as many times as I want’) or, as emphasised by an anonymous 
reviewer, as complements of the future marker tha (e.g. Tha to PIS. > sq Proser ‘You will 
say it’; cf. Giannakidou 2009). This last use bears some resemblance to the distribu- 
tion of Pres? forms in the Serbian variety of BCS, which can replace the bare infinitive 
and be licensed by the complementizer da after an inflected proclitic form of the (vo- 
litional) future marker (e.g. On ce [da dode] sour 3 ., proser He will come’ instead of On ce 
OCI oue inter)" The possible reasons for the difference between BCS and Modern Greek 
on one hand, the other South Slavic languages on the other are left for future research. 


8 It must be added, however, that the licensing of Pres" forms in BCS subordinate 
clauses is subject to further restrictions. Todorović (2017, 88-90) points out that, on 
the one hand, Pres” forms cannot occur in propositional complements of intensional 
verbs such as v(j)erovati ‘to believe’, at least when their temporal argument is not or- 
dered according to a relation of anteriority (<) or posteriority (>) with respect to the UT, 
hence the ungrammaticality of sentences such as *V(j)eruje da Jovan prevede, sus a sg. 
pres; P(j)esmu (intended: ‘(S)He believes that Jovan has translated a poem (just now)’). 
In addition, as a result of a more general constraint on the distribution of PF (which 
possibly holds cross-linguistically), in the Serbian variety of BCS Pres forms cannot 
be licensed as complements of phasal verbs such as počinjati ‘to begin’ (*Pocinjem da 
prevedem., c1 sy press; PESMU, intended: ‘I’m beginning to translate the entire po- 
em (just now)’). Generally speaking, as pointed out by an anonymous reviewer, pred- 
icates that select either subjunctive or infinitive forms in Romance languages can li- 
cense Pres” forms in BCS. This is, for instance, the case of verbs such as pokušavati 
'to try', whose tenseless complements do not yield a future interpretation whatsoever 
(*Pokusava da sutra Drevede,, suis 3 sg Prosrr pesmu, intended: '(S)He is trying to trans- 
late the entire poem tomorrow’) and can always enter an aspectual opposition with the 
corresponding Pres! forms. On the other hand, predicates selecting indicative forms 
in Romance languages block the licensing of Pres* forms in BCS. Among them are as- 
sertive predicates such as tvrditi 'claim', e.g. *Tvrdim da Ivan dode, (intend- 
ed: ‘I claim/am claiming that Ivan is coming’). 


COME.1.sg.PresPF 


9 Pres forms can appear in other nonveridical or anti-veridical contexts as well. With 
a small group of telic eventualities (e.g. dati ‘to give’, pustiti ‘to let’, dozvoliti ‘to allow’, 
doći ‘to come’), negative Pres forms are regularly licensed. These forms, which fall 
outside the scope of this paper, seemingly behave like light verbs and are prototypi- 
cally marked with 1st (singular and plural) or 2nd (singular and plural) bound person- 
number indexes, as in (i): 


(i) E, vidite, ja ne dam.ve1<gpreser taj trenutak, tu epohu, tu kristalnu kocku vedrine, 
“svečanost tragične vedrine”, ne dam. sg preset da to nestane. [HNK, vj20021110ku03] 
‘So, you see, | won't give away that moment, that age, that “crystal cube of serenity” 
[Kristalna kocka vedrine - a house of culture in Sisak], the “celebration of a tragic sereni- 
ty” [a theatre play by Tomislav Durbesic], | won't let it disappear’. 


Vojvodic’s (2015, 55-6) hypothesis that BCS elliptical constructions such as ne dam are 
derived via transformation of future-oriented negative volitional sentences of the type 
Neću [da dam] (lit. ‘I won't give (it)’) requires a more thorough examination. 
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habitual eventualities which however need to be explicitly marked as 
such"? (in (4) the non-actuality of the eventualities realised with the 
Pres” forms POPUEM rnk.1.sg.Prespr 1 drink' and ISCUGOI. rca cc prosit 
‘I squeeze' is signalled by the complex temporal adjunct svaki dan 
<nakon jela> ‘every day «after meals>’ and by the conjoined adjunct 
ujutro i navecer ‘in the morning and in the evening’):** 


10 Main clause (Stokavian) Pres forms can also be licensed in isolation, as long as 
the temporal anchoring of the clause remains unordered with reference to the UT. Un- 
like West and East Slavic languages, however, in contemporary BCS modal (dynamic) 
readings of Pres! have been - to the best of my knowledge - replaced entirely either by 
overt modal constructions governed by moći ‘can’ or by (stylistically unmarked) Pre- 
s»! forms, cf. (ii): 

(ii)? On u jednoj ruci digne (moze [da digne] / moze dici / diže 


LIFT.3.sg.PresPF urr.Infer 


precios) 25 kg. (Galton 1976, 92) 
‘He can lift 25 kg with one arm’. 


LIFT.3.sg.PrespF LIFT.3.sg. 


Other similar contexts include popular sayings (cf. (iii)) and semi-lexicalised expressions 
(cf. (iv)). Interestingly, in both cases 3rd p. sg. Pres» forms can freely alternate with Aor” 
forms, which in some cases are homographs and can thus be distinguished only prosod- 
ically (cf. d6de,mr.3.sg.presee He/She Comes’ vs. dóde sovr 3 sq Acree He/She came’). Note also 
that Aor™ forms in the given contexts do not trigger any anterior reading whatsoever: 


(iii) Ko se dima ne nadimi sox UP.3.sg.PresPr/3.sg.Aorpr? taj se vatre ne nagryje,,,, — | nagre- 
Ja; Sg.AorPF* 

*Results demand sacrifice (lit. He who does not catch the smoke catches no heat from 
the fire’). 


(iv) On mi dode,,,., .. prespr brat. (Dickey 2000, 199) 
‘He is my brother (lit. He comes to me as a brother"). 


Additionally, in several Kajkavian and Cakavian dialects Pres" forms can also be rein- 
terpreted as aspectual futures, most frequently alongside a temporal operator such as 
sutra ‘tomorrow’: compare sutra dodem,ur.1.sg.Presee "Tomorrow I'll come’, sutra ti javim- 
retknow.1.sg.Presee 1 l let you know tomorrow’, sutra naprintam,,.., .. ,,,,, 1 Il print tomor- 
row' and the like (I am thankful to Mladen Uhlik and an anonymous reviewer for pro- 
viding me with these as well as similar other examples). As for Stokavian areal varia- 
tion, Polovina (1985, 98-9) illustrates an interesting spoken exchange between two Bel- 
grade-based youngsters around mid-eighties where an apparently independent Pres? 
form seems to be freely licensed in a future-oriented context (Dobro, izađeš oue SA 
Prese U 7 kazes imam autobus u 7.30... ‘Fine, you come out at 7 o'clock, you say I've got 
my bus at 7.30...’). However, a closer look at the communicative situation reveals that 
the speaker is revisiting a potential future schedule for their interlocutor, in a way simi- 
lar to the so-called narrative future (Dickey 2000, 149-54), which in turn is usually con- 
sidered a peculiar instantiation of the historical present template. Since perfective as- 
pectual futures are virtually ungrammatical in Stokavian dialects, then, a comprehen- 
sive analysis of these contexts falls outside the scope of the present paper. 


11 Itshouldbe kept in mind that in both interrogative-negative and explicitly habitu- 
al contexts, Pres! forms (dolaziš, pijem, cijedim) can always occur. Unlike Pres" forms 
however, the use of Pres» forms in examples like (3) is linked to the emergence of par- 
ticular pragmatic inferences which have to do with the contextual actualisation either 
of circumstantial or dynamic modal meanings (among others Tanasić 1996, 169-70), 
while in examples like (4) the use of Pres” forms is generally preferred with telic even- 
tualities which are measured out by their (referential) internal argument (among oth- 
ers Dickey 2000, 68-71; Vojvodić 2019, 17). 
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(2) Kada sam stigao kući, seo sam da procitam,... | prese nekoliko radova koje su 
naši mali domaćini napisali na Casu veronauke povodom Vaskrsa. [SrpKor, po- 
li100402.txt, Politika (02.04.2010.)] 

‘Once I came back home, I sat and read some works which were written by our 
little hosts during their religion hour for Easter’. 


(3) Mnogo prirodnih ledenih zidova svih stepena težine Vas očekuje u i oko Livinja. 
Zašto ne dodete..ue 2 o prese i probate...» presos? [srWaC.4276.7, #4275] 


‘Lots of ice walls with all degrees of difficulty await you in and around Livig- 
no. Why don't you come and try yourself?’ (> ‘Come and try! You are invited!’). 


(4) “Svaki dan nakon jela popijem,.. 1 sg pres, T€ dO Tomislav pivo, a ujutro i navecer 
čašicu žestokog pelinkovca. U kavu obavezno iscijedim.,..., sg preser pola limuna, 
i doktor, vjerujte, meni ne treba [...]” [HNK, gs20040327sb29520] 
‘Every day after meals I drink a bottle of Tomislav beer, while in the morning and 
in the evening [I drink] a shot of Pelinkovac. In my coffee I squeeze half a lemon 
by default, and believe me, I don’t need any doctor’. 


However there seem to be three apparent exceptions to this high- 
er-level constraint applied to present-tensed main clauses. The first 
one involves the so-called abusive metonymic performatives (Dick- 
ey 2015), as shown in (5) below: 


(5) Da prevedem na srpsko-hrvatski-bošnjački: poserem se 
u državi u kojoj ste vi ministar. (Dickey 2015, 254) 
‘Let me put it in Serbo-Croato-Bosniak: | shit on my life in a state in which you 
are a minister". 


na svoj Zivot 


SHIT.1.sg.PresPF 


The second one involves the aspectual marking of a special subtype 
of running commentaries, live demonstrations, such as in the recipe 
instructions reported in (6): 


(6) Zapalim,,, ON FIRE.1.sg.PresPF joj kozu na vratu, pa preklopimo.» .sg.Prespr na leđa i 
u&vrstim,,... | sg Pres CACKalicama. Potom grabilicu napunim rakijom 

lozovačom, zagrijem peu T, zapalim 

puricu. [HNK, vj20091226pis010] 

‘I sear the skin [of the turkey] on the neck, then I fold it on the back and I fasten 

it with toothpicks. Then Ifill the ladle with lozovača [home-made grape rakija], 


warm it up and set it on fire, then I pour it on the turkey’. 


FILL.1.sg.PresPF 


SET ON FIRE.1.sg.PresPF? te polijemo ur. sg Presee 


The third one involves the possible licensing of Pres” forms in a par- 
ticular type of nonveridical context featuring the epistemic operator 
možda ‘maybe’, as in (7): 


(7) Nakonteniske karijere, posvetit ću se humanitarnom radu, a možda nađem nose. 
preser | Ženu s kojom ću zasnovati obitelj, kazao je 31-godišnji Chang. [HNK, 
vj20030202sp08] 
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‘After my tennis career is over l’Il devote myself to voluntary work, and maybe 
find a woman can start a family with, said the 31-year-old Chang’. 


In this paper it is claimed that these three contexts of Pres* use 
do not constitute an exception to the pattern we regularly find in 
Stokavian dialects. On the contrary, they can be subsumed under 
a unified analysis which, building on some theoretical tenets advo- 
cated by the so-called neoperformative hypotheses (among others 
Speas, Tenny 2003), aims at mapping the feature of (cognitive) con- 
trol in the higher portion of the clausal spine, above CP. The article 
is structured as follows: in 8 2 I will discuss the aforementioned 'ex- 
ceptions' in more detail with regard to the suppression of independ- 
ent Pres» forms from Stokavian main clauses; in § 3 I will reframe 
the issue at the syntax-pragmatics interface, providing some evidence 
for a proper syntacticization of the parameter of (cognitive) control 
inside SpeechActP; in 84 I will draw some preliminary conclusions. 


2 Breaking the Law? Anomalous Pres" Forms in Main 
Clauses 


In this section I will address each of the three case studies of this 
paper: abusive metonymic performatives (8 2.1), live demonstrations 
(8 2.2), and nonveridical contexts featuring možda (8 2.3). 


2.1 Abusive Metonymic Performatives 


The label abusive metonymic performatives (henceforth AMP; the 
term was first coined in Dickey 2015, 254-9; 268-70) applies to a 
closed class of obscene and disparaging phrases which, although 
lacking most of the characteristics ascribable to explicit performa- 
tive utterances,” share some interesting illocutionary similarities 


12 By explicit performative utterance we define a token of the utterance set U(x), ac- 
tualising a proposition P(x) with an explicit performative verb in its abstract logical 
form, which amounts to the action expressed in P(x) when uttered under felicitous cir- 
cumstances by an agent who is endowed with (or granted) the necessary authority. An 
explicit performative utterance has to be self-referential, pragmatically felicitous and 
grammatically well-formed; moreover, it must not satisfy any inherently antiperform- 
ative condition. Finally, it has to be categorically defined by the speaker in the speech 
act event and recognised as such by the hearer (Eckardt 2012, 47-8). 

Unlike AMPs, the prototypical tempo-aspectual form licensed in explicit performa- 
tive utterances (cf. the commissive illocutionary act exemplified by obecavati« ‘to prom- 
ise’ in (v)) is Pres", while Pres forms are regularly ruled out. It should be mentioned, 
however, that Pres» forms can be opposed by Fut», although only to a limited extent 
and as long as certain preconditions at the syntax-pragmatic interface (which cannot 
be explored in detail here) are met: 
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with them as well. In other words, AMPs, which refer to purely re- 
sultative eventualities and are typically a main clause phenomenon, 
do not constitute a simple depiction of the eventuality token actual- 
ised in the sentence; they also figuratively perform on the insulted 
hearer the action that the utterance refers to (hence the term 'met- 
onymic’). Relevant examples of Stokavian AMPs are reported in (5) 
above and (8) below: 


(8) Nabijem... s5 presee te na kurac! (Dickey 2000, 200) 
‘I thrust you onto my cock!’ 


Although AMPs are coded with Pres» forms in other Slavic languag- 
es, most notably those of the Western branch (which is in line with 
the Slavic East-West aspect division laid out in Dickey 2000), such 
marking stands out as peculiar in BCS. Comparing this usage with 
some exceptional cases of Pres” forms used performatively - both in 
Old Church Slavonic and Old Slovene - Dickey (2015, 268-70) claims 
that perfective AMPs should be considered a grammatical archaism 
inherited from Common Slavic, and credits ‘absolute control of the 
speaker’ as a key factor for their aspectual marking. 

Even though in Dickey (2015) the working definition of ‘absolute 
control of the speaker’ is not formalised, I have argued elsewhere 
(Biasio, forthcoming) that the author’s claim is sound, and that there 
is both historical, formal (i.e. morphosyntactic), as well as even evo- 
lutionary evidence to back it up.” It is also worth mentioning that 
AMPs can neither be coded as Pres" in their derogatory, performa- 
tive-like function (*Nabijam, cx 1.sg.presiee te na kurac could only report 
an ongoing event, if any), nor can they be marked with person-num- 
ber bound indexes different from Ist sg. (*Nabijes...... > sq preser ga NA 
kurac!) or negated (*Ne nabijem..... , << prespr t€ na kurac!). Additional- 
ly, other outrageous expressions that do not denote purely resultative 
eventualities resist marking with Pres? forms: cf. Jebem,....  .. veas. 
(*Odjebem ucr OFF.1.sg.PresPF / *ZajebeMg new nil ti mater ‘Fuck you, 
lit. ‘I fuck (*fuck off / *screw up) your mother’. The data seemingly 
suggests that the parameter of control may play a pivotal role in the 
aspectual coding of resultative eventualities in egophoric utterances. 


(v) Navijacima obecavam,owisei (*obecam /'éu obećati )mak- 
.sg.PresiPF promise.1.sg.Prespe! * PROMISE. 1.sg.FutPF 

simalno zalaganje, borbu za napad i svaku odbranu i to ce, valjda, doneti Zeljeni rezultat. 

[SrpKor, poli010303.txt, Politika (03 March 2001)] 

‘I promise our supporters absolute commitment and a relentless fight for attack and de- 

fence and this, | hope, will bring about the desired outcome’. 


13 Although I believe it is safer to assume that AMPs and other ‘perfective’ perform- 
atives in Old Church Slavonic and Old Slovene actually belonged to a general aspect 
(Biasio, forthcoming), in contemporary BCS such preverbed predicates are unambig- 
uously interpreted as PF. 
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2.2 Live Demonstrations 


There is a large body of literature concerning the aspectual mark- 
ing of running commentaries in Slavic languages (cf. among others 
Galton 1976, 95-7; Dickey 2000, 155-74), which are usually analysed 
as a peculiar instantiation of the historical present - irrespective of 
the significant variation they display both in their temporal anchor- 
ing and their extradiegetic properties. Instead, for the sake of clar- 
ity, in this subsection only a particular subset of running commen- 
taries is taken into account, i.e. live demonstrations (experiments, 
recipes etc.). As Dickey (2000, 161) appropriately points out, live dem- 
onstrations are interesting because of their complex temporal na- 
ture. Phrased otherwise, they are at the same time both (non-actu- 
al) generalisations over a definite set of propositions (possibly built 
upon a deficient anankastic-oriented model of the type 'if you want 
to X, you have to Y’) and the report of their ongoing actualisation in 
the utterance (i.e. the eventualities are actually unfolding at the UT). 

As shown in (6) above and again in (9) below, Pres* forms can be 
freely licensed in the given context. They can be contextually re- 
placed by PF imperative or infinitive forms (which however constitute 
different types of speech acts) or even alternate with Pres" forms, 
especially with unbounded eventualities. Person-number bound ver- 
balindexes may vary, mostly (but not exclusively) along the conjuga- 
tion axis 1st p. sg./pl.:** 


(9) Pecnicu zagrijemo,,. yo.1.p1.preser NA 250 stepeni i pecemo,,,. , pl pres, lepinje na dru- 
krpom i ostaviti dese- 


goj od vrha mrezi [...] Pecene lepinje pokriti |... | eee 

tak minuta da se ohlade prije rezanja. [sic!] (https: //www.coolinarika.com/ 
recept/bosanske-lepinje-f9c5e5f6-63db-11eb-8e23-0242ac120027) 
‘We preheat the oven to 250° and bake our buns on the rack (not the one on the 
top) [...] Coverthe baked buns with a cloth and letthem cool down for some ten 
minutes before cutting’. 


14 Variation in number (e.g. from singular to plural) is likely to be triggered by a dif- 
ferent structural configuration of discourse roles, which can be brought about in turn 
by a variation in the intensity of the illocutionary force assigned to the utterance. For 
instance, in her analysis of Russian 'mental performatives' (i.e. a subclass of exposi- 
tives which are frequently used to mold the rhetorical structure of written texts and 
public speeches in order to make them logically consistent, e.g. opredelit’* ‘to iden- 
tify’, predpoloZit^* ‘to suppose’, podcerknut’ ‘to emphasise’ and the like), Rjabceva 
(1992, 24) argues that the use of the 1st p. pl. of the perfective non-past (opredelim..., 
predpolozim..., podéerknem) is instrumental in underlining the impositive illocution- 
ary character of such predicates, in that the speaker, while unfolding their reasoning 
in a step-by-step fashion, expects their addressee(s) to follow the same logical steps 
and infer the same conclusion(s). 
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Dickey (2000, 168) further notes that in BCS this kind of PF-marked 
live demonstration cannot be overtly anchored to the UT by means 
of temporal operators such as sad(a) 'now' (here we gloss over some 
technicalities on the multilayered semantic nature of such deictic el- 
ements). For Pres forms to be licensed, reference needs to be made 
instead to the steps of the whole procedure as it is performed by the 
speaker. Moreover, if the eventualities are not presented as bound- 
ed, Pres forms can replace Pres, cf. (10): 


(10) Najpre UZmem.,.. 1 se Preser (uzimam,,.. | sp Presier) 
preser SadrZinu u retortu; onda upalim 
lampu i zagrejem,,., UP.l.sg.PresPF 
(Dickey 2000, 168) 

‘First | take the flask of sodium nitrate and pour the contents into this beaker; 
then | light the Bunsen burner and heat it to a boil...’ 


bocu sodium nitrata i sipam 


POUR.1.sg. 


LIGHT.1.sg.PresPF (DALIT esra sg presi) Bansenovu 
(zagrejavam ) do tačke ključanja... 


WARM UP.1.sg.PresiPF 


Dickey (2000, 173) goes on to suggest that the ban on the contex- 
tual presence of sad(a) alongside PF-marked live demonstrations is 
linked to the reportive character of the speaker’s speech, who would 
be in control of the flow of the events as they present them to their 
addressee(s) who are already ordered in a causal chain. Again, then, 
the aspectual coding of externally and/or internally bounded eventu- 
alities in semi-actual contexts seems to be dependent on the role of the 
point of view (POV) of a sentient individual in egophoric utterances. 


2.3 Contexts with možda ‘Maybe’ 


Pres» forms are regularly found in present-tensed main clause envi- 
ronments featuring the epistemic operator možda ‘maybe’, as in (7) 
above or (11) below. Variation can occur with respect to aspectual 
marking (unbounded eventualities, or presented as such, can be re- 
alized with Pres: forms) and person-number bound verbal indexes 
other than 1st p. (sg./pl.), as shown in (12): 


(11) Kad je mogao Dule, zašto ne bismo mi koristili taj recpet [sic/]. Možda 
pozovem,... «i sg presse VUÎOSEVICA da provededvamesecasnama.[srWaC.883681.21, 
#883680] 

‘If Dule could, why wouldn't we use this recipe? Maybe I can invite Vujošević to 
spend two months with us’. 


(12 


Prvo je i osnovno pravilo da sebe učiniš sreénom osobom, da budeš srećan 
čovek. Onda imaš šanse i nekog drugog da usrećiš, možda NapraviŠ sae. » presos È 
odgajlS, ... » .« Presi neku sreénu decu. [srWaC.1189461.10, #1189460] 

*The first and most basic rule is to make yourself a happy person, to be happy. 
Then you can have chances to make someone else happy, maybe you can make 


and raise some happy children’. 


56 


Balcania et Slavia e-ISSN 2785-3187 
1, 1, 2021, 47-66 


Marco Biasio 
It's All Under Control! On Perfective Present Forms in BCS Main Clauses 


This type of Pres* use, which is sometimes mentioned in passing even 
in prescriptive grammars (cf. Barić et al. 1997, 409), is worthy of at- 
tention as, unlike AMPs and running commentaries, it involves pure- 
ly nonveridical contexts, seemingly aligning itself with interrogative- 
negative structures (cf. (3)) and negative presents (cf. (i)). Should this 
pattern be consistent, we would expect Pres* forms to be licensed 
even alongside other epistemic operators semantically akin to mozda, 
such as verovatno,,/ vjerojatno,,, ‘probably’ or moguće ‘possibly’, but 
this is not the case as (13) clearly shows: 


(13) -Za sledeći put, i neki vikend... - Da, možda (*verovatno / *moguce) dodem 
pres» NA vikend ovde, da. [srWaC.1128436.4, #1128435] 
*- Next time, maybe some weekend... - Yes, maybe I'll come here for a week- 
end, sure’. 


COME.1.sg. 


On the basis of (surface) syntactic and prosodic evidence, one could be 
inclined to look at mòZda as the result of the univerbation of the epis- 
temic modal móZe,,, , <4 presi (from mdci ‘can’) and a ‘hidden’ comple- 
mentizer da, which, following Veselinović (2019, 46-58), occupies a po- 
sition in SpecC and takes a clausal complement (much in the spirit of 
the ‘declarative’ da proposed in Vrzié 1996).!5 While this may very well 
account for the diachronic derivation of the operator, it fails to provide 
a Satisfactory structural explanation as it does not provide reasons for 
the general acceptability of (14a) and (15),!5 nor does it clarify why the 
same pattern does not hold for valjda ‘probably’, ‘I guess’ (cf. (16)-(18)).!” 


15 Note, however, that there is no general agreement on the structure of the clausal 
architecture in BCS and that alternative hypotheses involving different makeups of the 
BCS clausal spine have been put forth as well (cf., for instance, Todorovic 2012, where 
it is proposed that the two da in BCS are not complementizers senso strictu, but rath- 
er operators introducing either veridical or nonveridical complements). Note also that 
Kovačević (2008, 197-202) points out that možda alongside Pres can license Fut" and 
Fut forms as well (Možda kupim,,., .. ,,,,, / CU kupiti,.., so.Futpr / budem kupio piya og Futt- 
prinasa. a Ptr kuću na moru). A further analysis of the range of variations with- 
in the pattern, particularly in relation to the licensing properties of valjda, is left for 
future research. 


16 According to my consultants, both (14a) and (15) are quite marginal in literary 
Stokavian and occur with a relative frequency in oral speech (according to an anony- 
mous reviewer, (14a) occurs most naturally as an elicited answer from a prior context). 
The majority of the Stokavian native speakers I consulted claimed that (14a) can be bet- 
ter parsed as an elicited answer to a previous context, although there is plenty of evi- 
dence for such contexts appearing in isolation as well, cf. (vi) below: 


(vi) Možda da jumbo-plakate dignemo,,.st.1.pipresse i kat više, pa uz onu narodnu “dale- 
ko od očiju, daleko od srca”, uživamo u nasmijeSenim modelima s reklama. [HNK, 
gs200307010s21879] 

‘Maybe we should raise the billboards one floor up and then, in accordance with the old 
saying “out of sight, out of mind”, we enjoy these smiling models from the advertisements’ 


17 According to my consultants, (16) can be uttered either if kupim?'*** is replaced 
by kupujem*'*** or if a relative temporal adverb such as dotad ‘by then’ is added to the 
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Indeed, valjda displays the same prosodical contour as mòzda and can 
be diachronically derived in a similar manner (i.e. via univerbation of 
valja from valjati ‘to be worth’ and da): 


3.sg.Pres.IPF 


(14) a. Možda da kupim"'**** kuću na moru. (Intended: ‘Perhaps I should buy a beach 
house’) 
b. *Moze da da kupim"'*** kuću na moru. 


(15) Možda može da kupim"'**** kuću na moru. 

(16) ""Valjda kupim"'**** kuću na moru. (Intended: ‘I guess | should buy a beach house") 
(17) ""Valjda da kupim*'*** kuću na moru. 

(18) ""Valjda valja da kupim"'*** kuću na moru. 

Interestingly, Miric (2004, 218) points out that in the Serbian vari- 
ant of the BCS continuum both možda and verovatno can be used if 
the realization of the eventuality actualized in the utterance rests 
entirely on the speaker, while valjda implies that such realization is 


not (completely) under the speaker's control: 


(19) - Hoćeš li ti doći večeras? - Možda. / - Verovatno. / - *Valjda.?? 
'- Are you coming this evening? - Maybe. / - Probably.’ 


context (the issue why a Pres" form here could be interpreted as a future perfect can- 
not be addressed here). (17) and (18) are acceptable only as interrogatives. 


18 Although both možda and valjda can be used either as full modal (epistemic-evi- 
dential) operators or parenthetical elements (Zvekic-Dusanovic 2019), the two (classes 
of) items are still assigned different intonational patterns. However, the difference in 
the epistemic commitment associated with the two operators (with valjda being epis- 
temically weaker and, possibly, doxastically stronger than mozda) does not seem to af- 
fect their syntactic properties. 


19 It goes without saying that in (19) valjda is not ungrammatical per se, however 
its distribution can only be generalised to contexts where the feature of the speak- 
er's control is not prominent. In order to better exemplify the contrast between oper- 
ators I resort to the following example (vii) (I thank an anonymous reviewer for sug- 
gesting it to me): 


(vii) Pavlovic se nada da nije kraj: Valjda cu odigrati još neki meč za Partizan. (http:// 
beta.admin.mozzartsport.com/fudbal/vesti/pavlovic-se-nada-da-nije- 
kraj-valjda-cu-odigrati-jos-neki-mec-za-partizan/354132) 

‘Pavlović hopes this is not the end: | will probably play some more games for Partizan’ (5 ‘I 
hope / suppose | will play some more games for Partizan, but it does not depend on me’). 


On a side note, možda can also stack together with valja, as in the following example (viii): 


(viii) Da, možda valja uključiti i moldavski i ukrajinski. (https://twitter.com/srdjan- 
drago/status/1290913720856698880) 
‘Yes, probably it is worth including both Moldavian and Ukrainian’. 
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This would suggest that the licenseability of Pres forms is not (ex- 
clusively) triggered by either the compositionally derived syntactic 
properties of možda (with reference to, for instance, vjerojatno,./ 
verovatno „and moguće) or by the nonveridical flavour of the context 
(cf. the contrast with valjda). Another aspect seems to bring možda- 
contexts together with the other case studies under analysis, i.e. the 
prominent salience of the speaker's control-related POV. In the next 
section I will elaborate on this idea and argue that control, beside 
its semantic contribution, is a syntactically relevant element as well. 


3 Controlas a Variable in SpeechActP 


The idea that linguistic components traditionally assigned to dis- 
course-related modules (e.g. the role of discourse participants in 
the speech act event, other logophoric roles, and the notion of POV) 
should be encoded in the syntactic structure dates back to the early 
seventies, and would be brought to light some decades later on the 
basis of consistent crosslinguistic evidence. Speas and Tenny (2003, 
320) were among the first to argue for the existence of a separate 
functional layer above CP, the so-called ‘SpeechActP’, where dis- 
course roles (traditionally, the Speaker and the Hearer) are mapped 
and defined in terms of their structural positions. 

In their joint paper, Speas and Tenny note that the restrictions on 
the inventory of phenomena linguistically dependent on the POV of 
a sentient individual tend to be crosslinguistically stable, which may 
hint at the hybrid (i.e. not entirely pragmatic) nature of the proper- 
ties of such phenomena. They thus propose a further configurational 
structure in the scope of SpeechActP, a projection called ‘Evaluation 
Phrase’, which takes two arguments: the Seat of Knowledge (a "sen- 
tient ‘mind,’ who can evaluate, or process, or comment on the truth 
of a proposition", p. 332, henceforth SOK), mapped to the specifier 
position, and the Evidential Phrase, which is linked to the evidential 
inference(s) on the truth of a given proposition and mapped to the in- 
ternal argument position (Speas, Tenny 2003, 327-31). 

Since Speas and Tenny (2003), there has been much speculation 
on the exact composition of the inner layer of SpeechActP, includ- 
ing the number of functional projections included within the maxi- 
mal projected structure and their respective ordering. In this paper 
I will adopt Zu's (2018, 73 ff., 101-2) slightly revised idea that the SOK 
appears in the specifier position of a Sentience Phrase and is bound 
by the base-generated Speaker in declaratives, rhetorical questions 
and quiz questions. Within this framework, I define cognitive con- 
trol as the hierarchical and contingent (i.e. not given a priori) rela- 
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tion between the Speaker and the Hearer in the speech act event,?? 
which can be modelled as an abstract variable specified at the level 
of the SOK, checked against by the Speaker (which may or may not 
be coindexed with the syntactic subject) and spelled out lower in the 
structure in AspP as PF. 

The idea that the cognitive feature of the speaker's control may 
additionally have a syntactic salience stems from the following two 
observations. The first one echoes the treatment of speech-style par- 
ticles and markers of politeness in Korean discussed in detail by 
Portner, Pak and Zanuttini (2019). Abstracting away from the (quite 
dense) details of their proposal, by replacing SpeechActP with anoth- 
er functional projection called 'cP' ('c' stands for ‘context’), the au- 
thors argue that different allocutive pronominal forms, syntactical- 
ly bound by the Hearer ('Interlocutor' in their terms), are spelled out 
via an operator-variable relationship mediated by a A-abstractor type 
that would reflect the alternation of abstract features on the function- 
al head c (Portner, Pak, Zanuttini 2019, 24-6). Judging from our da- 
ta sample, control is always related to the Speaker notwithstanding 
the presence of an actual coindexing of the discourse participant role 
with the syntactic subject (cf. above (12)). In other words, control al- 
ways seems to be tightly linked with the expression of the Speaker's 
POV in egophoric utterances. Following Zu's (2018) account, we can 
thus assume that in declarative contexts the SOK is always bound by 
the Speaker and that control, which can be overtly expressed when- 
ever a hierarchical relationship between (at least) two participants 
in the same speech act event is established, seemingly functions as 
a variable mediated by a A-operator at the level of the SOK, as tenta- 
tively sketched out in (20): 

(20) [ Speaker[ 


SOK[.. Acon]]]]] 


deal e: Sen 


The second observation boils down to the correlation between con- 
trol and perfective morphology. Dickey's (2000, 25-7) classical pro- 
posal of a Slavic East-West aspect division on the basis of the pro- 
totypical meaning assigned to PF places BCS in a transitional zone, 
with Stokavian varieties sharing more commonalities with the East- 
ern group. The crucial point here is that control can be seen as a 
contextually determined effect stemming from the aspectual seman- 
tic core of PF in Eastern Slavic languages, namely temporal definite- 
ness, i.e. the assignment of an eventuality to a uniquely determined 
point on the time axis, over which the speaker would then yield full 
control (Dickey 2018). If we assume that in Stokavian the variable 


20 Acomprehensive discussion of the pragmatic variables of the participant struc- 
ture lies beyond the scope of the present paper. 
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for control is specified at the level of the SOK and thus bound by the 
Speaker, it is possible to claim that the SOK enters an agreement re- 
lation with the aspectual projection at a distance (much like probe- 
goal agreement), consequently triggering the spell-out of the mor- 
phological element(s) associated with PF.” If this hypothesis is indeed 
on the right track, an example like (21a) would then have the struc- 
tural representation in (21b) (note that only the relevant projections 
of the clausal spine are featured below): 


(21) a. Možda kupim, <; preso kuću na moru. 
b. lspeschacep S PEaker L enplspecsen 2 OK [genACON LeplepeccMOZAL,..plspecaspPFlyplykupitil,, 


kuću na moru]]]]}}]}}}]]] 


4 Conclusions 


In this paper, I presented a unified analysis for several contexts fea- 
turing the (apparently) anomalous licensing of Pres» forms in BCS 
present-tensed main clauses, i.e. abusive metonymic performatives 
(8 2.1), live demonstrations (8 2.2), and nonveridical contexts with 
možda (8 2.3) inter alia. I claimed that for all these cases the feature 
of (cognitive) control, which can be defined in terms of the hierar- 
chical relation between the Speaker and the Hearer - and thus en- 
coded in the syntactic structure as an abstract variable at the level 
of the SOK in SpeechActP - bears special relevance. The association 
between control and perfective morphology can be modelled in terms 
of a probe-goal agreement between the SOK and the aspectual pro- 
jection and can be explained as a semantic extension of the proto- 
typical function assigned to PF in the Stokavian varieties of the BCS 
continuum, i.e. temporal definiteness. 

Regrettably, a number of pertinent questions could not be an- 
swered in this survey. For instance, it remains unclear why Pres* 
forms cannot be licensed in similar semi-actual contexts, whereby 
control is also shown to play a significant role, e.g. performative ut- 
terances (cf. (v)).? Furthermore, the role of verbal tenses (more spe- 


21 Itisimportant to mention that in BCS an eventuality can be realised as PF only if 
it is externally and/or internally bounded. Internal boundedness here corresponds to 
what is commonly labelled '(inherent) telicity' (resultativity). Externally bounded even- 
tualities do not need to be telic. 


22 It may be the case that, differently from AMPs, the temporal anchoring of per- 
formative utterances is assigned a different ordering with reference to the UT. It may 
also be the case that in regular performative utterances such as (v) the hierarchical 
relation between the speaker and the hearer is modelled in a slightly different fash- 
ion with reference to AMPs, with the speaker still yielding control over the eventuali- 
ty in their scope (domain) yet being unable to intrude in the hearer's domain and take 
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cifically, the aorist and the two future tenses) and their interaction 
with the higher layer of the syntactic structure should be taken in- 
to greater consideration. Finally, dialectal internal variation should 
be reviewed more thoroughly so as to reinforce the core principles 
of the main argument. All these issues and their interplay are thus 
worthy topics for future research. 


List of abbreviations 


Aor** perfective aorist 

fem feminine 

Fut’ perfective future 

Fut": perfective future perfect 
HNK Hrvatski nacionalni korpus 
HR Croatian 

Inf infinitive 

IPF imperfective 

masc masculine 

p person 

PF perfective 

pl plural 

POV point of view 

Pres** perfective present 
Pres!?F imperfective present 

sg singular 

SOK Seat of Knowledge 

SR Serbian 


SrpKor Korpus savremenoga srpskog jezika 
srWaC Serbian Web Corpus 
UT Utterance Time 


it over completely (Dickey, personal communication). I am grateful to Stephen Dickey 
for the fruitful discussion on these data. 
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1 Introduction 


Slavic clitics have been the focus of attention in a number of works 
in the recent decades (Franks, King 2000; Franks 2017; Zimmerling 
2013; Migdalski 2016; among many others), with accounts also on the 
diachrony of cliticization in (one or more) Slavic languages.* 

The diachrony of the Bulgarian clitics has drawn particular inter- 
est because Bulgarian is part of the Balkan Sprachbund and as such 
it exhibits verb-adjacent cliticization pattern, but also because it dis- 
plays various patterns in its history.? Although it has been claimed 
that the position of clitics in Old Church Slavonic (OCS) texts mostly 
echoes their placement in the Greek originals, there are numerous ex- 
ceptions reflecting patterns in the language of the scribes (especially 
in constructions without parallels in Greek; see Stawski 1946, among 
others). In her seminal study on the clitics in the history of Bulgar- 
ian, Pancheva (2005) argues that verb-adjacent clitics found in Old 
Bulgarian (and OCS) temporarily switched to second-position (up to 
and beyond the Middle Bulgarian period), only to become verb-adja- 
cent again (in modern Bulgarian). Migdalski (2016) claims that in the 
majority of cases pronominal clitics in OCS were verb-adjacent while 
the second position (2P) was obligatory only for the so-called 'oper- 
ator’ clitics (discourse markers and ‘ethical datives’); he relates the 
verb-adjacency to the presence of tense morphology in a language. 

The discussion in the present article is based on data excerpted 
from manuscripts that presumably reflect the diachrony of the Bul- 
garian language and are closer to the vernacular than to the literary 
language. The main aim of the article is to sketch the evolvement of 
the (sentential) clitic cluster in the history of Bulgarian and to check 


Part of the work presented in the article was performed within the Russian-Bulgarian 
bilateral project Evolution of the Grammatical Structure in Russian and Bulgarian Lan- 
guages in a Contrastive Context: Corpus Analysis and Formal Grammars. The Bulgari- 
an part has been financially supported by the Fund of Scientific Research at the Min- 
istry of Education and Science with a contract N02/2 from 14.06.2018. A version of the 
text was presented at the 25th annual conference Derzhavin Readings in March 2020, 
within the section "Contemporary and Historical Problems of Bulgarian and Slavic 
Studies" (with a short paper included in the proceedings). I would like to thank the two 
anonymous reviewers of the present article for the constructive comments, as well as 
Andrej Bojadziev for the discussion of some points and Iliyana Krapova for the exten- 
sive comments and suggestions. Needless to say, the Author is solely responsible for 
all the remaining errors. 


1 Slawski 1946; Radanovic-Kocié 1988; Pancheva 2005; Kosek 2011; Zimmerling 2013; 
Migdalski 2013; 2016. 

2 Stawski 1946; Pancheva 2005; Zimmerling 2013; Migdalski 2013; 2016. 

3 Here Iwill use the term Old Church Slavonic (OCS), which reflects the status of the 
language as used by the Slavic orthodox community (cf. Bujukliev et al. 1993). Howev- 
er, the linguistic features of the texts in the monuments are assumed to reflect those 
of Old Bulgarian. 
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whether there has been a significant change in the clustering pattern. 
The observations on this specific set of data confirm the old hypoth- 
esis that the clitics in Bulgarian are (and were) verb-adjacent (and 
also target 2P). The necessary ramification here is that any conclu- 
sions based on heterogeneous diachronic data highly depend on the 
sources and their analysis. The notion ‘second position’, for example, 
depends on syntactic constituency within the language and on the in- 
terpretation of the data that are taken into account. With diachron- 
ic texts, the segmentation is also a matter of interpretation: for the 
OCS data here, the texts were already segmented within the corpus 
used for reference; with the other data (Middle Bulgarian and Early 
Middle Bulgarian), however, segmentation was done aside. The dif- 
ficulties with the specifics of the data remain: with the OCS data I 
have tried to consider variant readings if accessible, with interest- 
ing results;^ this should be done over all the data if possible (for ex- 
ample, one should consider variant readings within the texts of the 
damaskin collections). In order for such a task to be accomplished, 
however, one needs a really big and smartly annotated corpus, which 
is unfortunately missing at this time. 


2  Cliticsin Bulgarian 


To define "what a clitic is' is not a trivial task because, as Franks 
(2017, 146) puts it, "the idiosyncrasies of clitics all go in the same 
direction, i.e., a negative one". It is easier to say what clitics are not 
rather than what they are, because they are - prosodically, morpho- 
logically, lexically, and syntactically - ‘deficient’. Firstly, clitics are 
prosodically deficient as they do not have an independent stress, but 
form a prosodic unit with another, prosodically ‘strong’, word. How- 
ever, in many languages clitics may host the stress when preceding 
or following other elements that remain prosodically ‘weak’ (or may 
receive secondary stress). In present-day standard Bulgarian, clit- 
ics may bear stress after negation as in (1) (there is also a hypothe- 
sis for a secondary stress here; Krásteva (2020, 119), on the basis of 
experimental data, claims that in interrogative sentences with ne- 
gation, even when the clitics receive stress, they do not have "an in- 
dependent intonational peak"). 


4 A more detailed review cannot be supplied here but ca. 4396 of the occurrences 
of the datives wu ‘I.DAT’ and Tu ‘you.DAT’ in Codex Marianus (CM) have variant read- 
ings in other New Testament (NT) texts as witnessed in Codex Zographensis (CZ), Co- 
dex Assemani (CA) and Codex Sabbae (CSb), and/or in the respective Greek editions. 
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(1) 

He. MY nane KHMrara, Hann? 

NEG' he.DAT gave book.the right 

* Thefollowing abbreviations are used in the glosses: NEG, negative particle; EMPH, 
emphatic particle, e.g. xe; NOM, nominative; GEN, genitive; DAT, dative; ACC, accusative; 
ReflcL, ca; ReflPosscL, CH; DEF, word form with a definite article; cop, copula; AUX, 
auxiliary; COND, conditional; INF, infinitive; IND, indicative; AOR, aorist; IMPF, imperfect; 
RENARR, renarrative; SUBJ, subjunctive. The elements at focus are given in bold. 


Secondly, clitics are a 'bag' of grammatical features (Sadock 1991) 
and do not have lexico-conceptual features, i.e. they are also semanti- 
cally deficient (Franks 2017, 154). Historically, the short dative forms 
of the first- and second-person singular pronouns mu ‘I.DAT’ and Tu 
'you.DAT' were already clitics in Old Bulgarian/OCS (as were the par- 
allel pronouns in Greek, see Wackernagel 2009; Vecerka 1989; 1993); 
other short pronominal forms, however, became clitics as a result of 
early attrition: 3pSg masculine accusative e-ro (1e-ro)? ‘he.Acc’ > ro 
‘he.acc’, with further reconsideration of r- as a third-person marker 
resulting in a generalised third-person plural with the ending -u (ru 
'they.ACC'); em/e > x > a ‘she.acc’ (3pSg feminine accusative); emoy 
> Moy 'he.DAT' (3pSg masculine and neuter dative), eu > eii > ii ‘she. 
DAT’ (3pSg feminine dative), etc.; see Byjukliev et al. 1991, 234-6; 
Mircev 1963, 165-6). 

Thirdly, clitics are syntactically dependent although different ele- 
ments may exhibit different behaviour. Clitics can be categorised in- 
to 'simple' (non-paradigmatic) and 'special' (paradigmatic) (Zwicky 
1977; 1985): in present-day Bulgarian, the latter are present-tense 
auxiliaries/copulas, short forms of personal and possessive pro- 
nouns, which have corresponding 'full' (prosodically and syntacti- 
cally ‘strong’) forms. 

Clitics are known for their ability to clusterise - in some languag- 
es the clitic cluster has a fixed position in the clause or in the phrase 
(in standard Bulgarian these are the ‘special’ clitics and the ‘sim- 
ple’ interrogative mu). The so-called Wackernagel clitics in Indo-Eu- 
ropean languages were all second-position (2P) clitics. At issue here 
would be the definition of the second position: whether the clitic is 
positioned after the first word in the sentence, as originally formu- 
lated by Wackernagel (2009), or after the first syntactic constituent 
(including a phrasal one) (Halpern 1995), or after the first constit- 
uent that immediately follows the so-called rhythmic-syntactic bar- 
rier of Zaliznjak (2008). Zaliznjak introduced this barrier as an ad- 


5 Orthographic variants throughout the text are generalised if possible (e.g. ero = 
erw; Mu = Mi etc.), except for certain examples. The same is true for jotated variants 
(i.e. ero is given instead of tro etc.). 
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ditional (and obligatory) condition to make the Wackernagel effect 
work for the Old Russian clitics. Additionally, the restrictions known 
as the Tobler-Mussafia's law (Tobler 1875; Mussafia 1888), original- 
ly formulated for the Romance languages, do not allow ‘special’ clit- 
ics to be first in the sentence. 

All these constraints are partially valid for present-day standard 
Bulgarian: clitics are mostly verb-adjacent, and are found either af- 
ter the verb in the first position in the clause, as in (2a), or before 
the verb if there is another constituent in first position, as in (2b). Al- 
though (2c) is ungrammatical with clitics following both the subject 
and the verb and coming third in the linear order of the clause, clit- 
ics can still be found further up in the clause following more than one 
constituent, as in (2d). Clitics may also land third after a preceding 
verb that, however, is analysed as first after the rhythmic-syntactic 
barrier (with other sentential element to its left), as in (2e). 


(2) a 

Aane My A VUBaH KHMrara 

gave he.DAT she.Acc Ivan book.DEF 

b. 

UBaH My A nane KHMrara 

Ivan he.DAT she.acc gave book.DEF 

c. 

*UBaH page My A KHMrara 

*lvan gave he.DAT she.Acc book.DEF 

d. 

B cb60Ta n3HeHagBawyo VBaH My A nane KHMraTa 
In Saturday surprisingly Ivan he.DAT she.acc gave book.DEF 
e. 

Da, nane My A KHMraTa MWBaH 

Yes, gave he.DAT she.Acc book.DEF Ivan 


According to Zimmerling (2013, 89), there are different language sys- 
tems regarding the position of the clitics, and Bulgarian belongs to 
the class of languages with so-called ‘extended WP+ system’, in which 
the clustering clitics are expected to be verb-adjacent or rather the 
verb in sentences with 2P clitics is expected to be clitic-adjacent, i.e. 
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the positional constraint here is both on the clitic(s) and on the verb.* 
In addition, the clustering clitics are in more or less fixed position 
relative to the clause boundary - generally second after an element 
in complementiser position (conjunctions, wh-words, other relativ- 
isers, etc.) or following the first constituent that comes immediate- 
ly after the so-called rhythmic-syntactic barrier of Zaliznjak (2008). 

The following considerations are applicable for the discussion that 
follows. In the article, I assume that second-position of the clitics is 
the position following: (a) a specific set of conjunctions (za ‘to’, ue 
‘that’, bko ‘as’, ame ‘if’, amu ‘but’); (b) wh-words; (c) other relativis- 
ers (rero, ne, moro ‘that, which’);’ (d) a constituent in focus or top- 
ic; or (e) verbal element in first position. Sentential constituents lo- 
cated outside the boundary ofthe clause such as vocatives and some 
extra-sentential particles do not count as first position. Different- 
ly from other Slavic languages, clitics in Bulgarian cannot be found 
first in the sentence, and in spite of some exceptions in historical 
texts, Bulgarian clitics in general comply with the Tobler-Mussafia's 
law if and only if the latter applies to the clause and not to the lev- 
el of the intonational phrase (i.e. relative to the rhythmic-syntactic 
barrier of Zaliznjak (2008). In what follows, we will see that these 
considerations are also valid for the history of Bulgarian in spite of 
some counter-examples. 


3 Clustering: The History 


The hypothesis probed in the article is that the order of the elements 
within the clusteris a follow-up on the diachronic development of ele- 
ments targeting the second position, from left to right. Thus, the old- 
est clitics (2P) would be on the far left (or first) of the clustering el- 
ements, while the newest ones would land on the (far) right. Beside 
the special clitics and the discourse clitics, elements that are found 
around the cluster and are considered (semi-)part of it by some au- 
thors (see more in Zaliznjak 2008), are: the interrogative clitic mu, 
which is found on the far left but also, given that it does not impose 
any syntactic restrictions on the preceding element, it may show up 
after each of the other elements in the cluster; the old adverbial par- 
ticles u ‘and’, roy ‘then’; and the adverbs nak, naks ‘again’. 


6 As one of the reviewers rightly put it “a decisive factor could be the position of the 
verb, and in that scenario the second position requirement is irrelevant”, although there 
are still some exceptions, though obscure or stylistically marked. 


7 These form prosodic words (linked by _) with the clitics in some texts - in fact, in 
these positions, the clitics are part of the first prosodic complex: DLv (17th c.; Mladen- 
ova, Veléeva 2013): &ero My ce náne BÓJA ÖHOMÝZH BJIÉKÝ. WOTO e Cera àHTHXpicTb. 
9 Jia C€ MOKNOHATS B/IÉKY, ue My^e pdOoTaTa KaTO Ha BJIÉKP. 
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Not all clustering elements in present-day Bulgarian show simi- 
lar behaviour. Monosyllabic auxiliaries, other than the present-tense 
ones, such as the conditional 6u ‘would’, the past-tense 6e ‘was/were’, 
the future me ‘will/shall’, may also appear first in the clause. Both in 
present-day standard Bulgarian and in the historical data observed 
in the article, these auxiliaries may be found either to the left or to 
the right of the pronominal elements (the short forms of dative and 
accusative personal pronouns, the short forms of possessive person- 
al pronouns, which have the same form as the dative personal ones, 
and the short reflexive pronouns - accusative ce and dative cu). With- 
in the cluster, the pronominal clitics appear in a fixed order with re- 
spect to one another: the dative always precedes the accusative (with 
reflexives behaving in a parallel way), while the present-tense aux- 
iliaries show a split behaviour: all persons except third person sin- 
gular show up to the left of the pronominals, while the third-person 
singular auxiliary (e'is.3pSg') shows up to its right. 

In the present article, I propose that the cluster is composed of a 
primary layer and a secondary one. The primary layer is further split 
into a core (pronominal) and a periphery (verbal). This is also histori- 
cally motivated as the auxiliary clitics have joined the clustering ele- 
ments later (present-tense auxiliary clitics are derived via attrition to 
the left orto the right of the remaining elements: first-person singular 
€-CMb > CbM 'am.1pSg'; second-person singular e-cu > cu 'are.2pSg;; 
third-person singular e-cT > e 'is.3pSg' third-person plural cx-r» > 
ca ‘are.3pPl’). The secondary layer hosts elements that are clitic-ad- 
jacent such as past-tense auxiliaries whose position also varies as in 
(3a) vs. (3b), as well as the conditional (see Nicolova 2008). 


(3) a 

VUBaH 6e/6ewe/6un My ro pan. 
Ivan be.AOR/IMPF/RENARR he.DAT he.Acc given 
b. 

VBaH My ro 6e/6ewe/6un pan. 
Ivan he.DAT he.Acc be.AOR/IMPF/RENARR given 


In the pattern in (4), the primary layer is in bold. Some dialects have 
retained only present-tense AUX2, i.e. the verbal clitic is to the right 
(Antonova-Vasileva et al. 2016, map 144). 


(4) (Qu) (AUX0) AUX1 DAT ACC AUX2 (AUX +) 


The ordering in (4) illustrates the overall pattern in present-day 
standard Bulgarian and if we are to judge by the data discussed be- 
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low, it has undergone relatively few changes, which have rather af- 
fected the inventory of the elements that may appear in the respec- 
tive slots rather than the pattern. 

The earliest 2P clitics in Old Bulgarian/OCS are the discourse 
markers xe and 60 (the latter, however, was lost early). The pronom- 
inal clitics, which presumably joined the cluster at a later point and 
which build the core of the primary layer also in the history of Bul- 
garian, can be classified into several subclasses depending on their 
placement within the cluster: (1) the dative first-person singular mu 
'L. DAT' and second-person singular Tu ‘you.DAT’; (2) the short forms of 
anaphoric pronouns (which would be later reanalysed as third-person 
pronouns): n ‘he.acc’ (masculine singular; neuter and feminine dual), 
x 'she.ACC' (feminine singular), e ‘it.acc’ (neuter singular), A 'they.ACC' 
(plural, masculine and feminine), 5/ra ‘they.DU’ (neuter plural; mascu- 
line dual); the accusative short forms of the first- and second-person 
singular personal pronouns MA 'I.ACC' and TA 'you.ACC' (and, possibly, 
plurals and duals), defined as semi-clitics (Vaillant 1948, 262; Večerka 
1989, 42) as they may occur in 'strong' positions - after a preposition 
and (rarely) at the beginning of a sentence; and reflexives cA (accu- 
sative) and cu (dative), also found after prepositions; (3) in later texts, 
the inventory expands with third-person singular datives Moy 'he.DAT' 
(masculine and neuter) and zi 'she.DAT' (feminine) derived from the 
earlier dative short forms emoy and ex that also tend to appear more 
or less adjacent to the position of the cluster in earlier texts. 

Both historically and in present-day Bulgarian, verbal elements 
('connectors'; Zimmerling 2013) among the clustering elements are 
located in the periphery of the primary layer and display variation in 
placement with respect to other clustering elements in that they can 
occupy either AUX1 or AUX2 in (5) below. These are: the monosyllab- 
ic auxiliaries of 6t1Tu (optative 6/61 ‘would’; aorist Gb ‘was/were. 
AOR’; imperfect 65 ‘was/were.IMPF’, which, however, may be found 
also in first position); the future auxiliary me/mx ‘will/shall’ (with 
variants xe, xTe, ue, etc.) appearing not earlier than Middle Bulgar- 
ian; present-tense auxiliaries and their successors: ecMb (CbM, CM») 
'am.1pSg', ecu (cu) '2pSg', 3pSg ects (e) 'is.3Sg', ecme (cme) 'are.1pPT', 
ecre (cre) ‘are.2pPl’, caTt (cm, ca) 'are.3pPl. 

The interrogative clitic mu (to the far left in both (4) and (5)) and ad- 
verbial particles are part of the secondary layer. These are: Tu ‘and’, n 
‘and’, maxK(b) ‘again’, Toy ‘then’, etc. The adverbs may show up on both 
sides ofthe primary layer, as indicated by the two positions of ADV in (5). 

The pattern in (5) gives the relative order ofthe elements that tend 
to clusterise as witnessed in the earliest historical texts. With some 
changes, this order correlates with the one that has been observed 
by Zaliznjak (2008, 82) in the Novgorod charters. Note that not all 
elements that appear in bold in (5) are necessarily clitics; as will be 
seen below, some of them are semi-clitics, while others are clitic-like. 
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(5) DISC QU DISC ADV AUX1 DAT (REFL)®ACC AUX2 ADV 


According to the so-called ‘historical principle’, the order of the clit- 
ics in the cluster mirrors the chronology in which they have become 
‘weak’, a pre-condition for their later reanalysis as clitics. This prin- 
ciple predicts that potential members of the clitic cluster would join 
the clustering elements only to their right. According to Zimmerling 
(2013), this principle cannot explain the order but can only account for 
the change in the status of the elements, which is a mechanism for ex- 
panding the cluster. Observations on the data that will be discussed, 
however, show that this is true for a limited set of elements with rela- 
tively fixed slots, either in the clause or in the phrase - the discourse 
markers and the pronominal clitics, which, as mentioned above, con- 
stitute the core of the clitic cluster (while verbal elements fluctuate).° 

The discussion is based on data that has been excerpted from texts 
from three diachronic corpora. For Old Church Slavonic (OCS), these 
are the annotated texts of two monuments - Codex Marianus (CM) 
and Codex Suprasliensis (CSpr) within the TOROT corpus (Haug, Eck- 
hoff 2011a; 2011b), which are used for the numbers; variant readings 
here are consulted according to three monuments - Codex Zogra- 
phensis (CZ), Codex Assemani (CA), and Codex Sabbae (CSb) in the 
TITUS database. For Middle Bulgarian, the texts used are the Leg- 
end of Troy (LTr) (14th c.; Miklošič 1871) and the Vlacho-Bulgarian 
Charters (VB) (15th-16th c.; Bogdan 1902) that are available in the Di- 
achronic Corpus of the Bulgarian Language (Totomanova 2015). For 
Early Modern Bulgarian (EMB), the texts in two damaskins (17th c.) 
are used: the Damaskin of Troyan (DTr; Ivanova 1967) and the Dam- 
askin of Lovech (DLv; Mladenova, Velceva 2013). The texts were cho- 
sen because of their accessibility and availability!° for replica and 
the hypothetical closeness to the vernacular.** 


8 The reflexive pronoun cA ‘self.acc’ exhibits ‘mixed’ behaviour of a marker that re- 
flexivises the verb and a semi-clitic but is also found among the clustering elements as 
we will see later. There are a couple of examples with other accusatives, therefore the 
reflexive is placed within the pattern here, albeit tentatively. 


9 Expectably, some orders that are present in our data are not found in present-day 
standard Bulgarian: for example, pronominal clitics could occupy the first sentential 
position that is never the case in present-day standard Bulgarian (though this phenom- 
enon is observed in some dialects, see Iliev 2018, Tiseva 2008, among other). Also ex- 
pectably, clitics were found in the absolute second prosodic position as well; in some 
cases, they could split a noun phrase much like what happens in present-day Serbian. 
This is no longer the case in standard Bulgarian. 


10 Within the TOROT corpus: http://syntacticus.org/; the TITUS database: htt- 
ps://titus.fkidgl.uni-frankfurt.de/framee.htm?/index.htm; and Diachronic Cor- 
pus of the Bulgarian Language: https://histdict.uni-sofia.bg/textcorpus/list 


11 Texts in CSpr, VB, DTr and DLv have different linguistic properties, probably fol- 
lowing different redactions (and translations) (in the case of CSpr, DTr, DLv; the col- 
lection of VB in the Bogdan's 1902 publication contains texts written over two centu- 
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3.1 Discourse Clitics 


The discourse clitics xe (a focus marker or an emphatic particle) 
and 60 (a complementiser or a marker of subordination) are wide- 
ly attested in OCS (with parallels in Greek; cf. Thayer 1889). These 
clitics usually appear in the second clausal and/or phrasal position: 
they are the earliest 2P clitics and both were already used as suffix- 
es (as in nxe ‘who/which’, HukbToxe ‘nobody’, Hexe ‘than’; oy6o 
‘therefore’, 160 ‘because’, etc.). Migdalski (2016) calls them “opera- 
tor clitcs” since they additionally ‘operate’ on the information struc- 
ture of the sentence. If found together (rarely in the observed texts), 
the order is xe > 60, as in (6a); pronominal clitics follow them (6b-c). 


(6) a. 

ako xe 60 BBNBZE Kb Gy UeHUKOMb CSpr, 251r, 19-20 
if EMPH because entered to disciples 

"Ott pv yap ticfjA0E — mpóc tos pa@ntàg 


i Duetospace limitation, glosses are given if there is no appropriate translation and 
with elements that are in focus in the article, mainly clitics (e.g., is.AUX.3pSg: present- 
tense auxiliary, 3pSg; be.AUx.COND: conditional auxiliary; etc.). 


b. 

oyHbe 60 TU ecTb CM, Mt. 5,29 
better because you.DAT is.COP.3pSg 

cuupéper yap cot 

na norbi6nere eguHb oypd TBOMXb 

to die one members.GEN  your.GEN 

iva amoAntat év TOV LEAVY oou* 


* Three Greek NT editions were consulted. Wherever possible, | supply the 
corresponding Greek text with variants according to other editions if they attest for 
variant readings with respect to the clitics. If there is no reference to an edition, the 
text is given according to Nestle 1904; for a discussion on editions vs. manuscripts, 
see Toufexis 2010. 


c. 
Bb3facTb 60 TU CA CM, Lk. 14,14 
give because you.DAT ReflcL 

avtamoSo8noetat yap cot Nestle 1904 
ávramoóo0noserai &é cot Tischendorf 8th ed. 


ries ago, which reflect different language varieties and influences). The raw numbers 
given in the article show the overall trends without aiming at a thorough analysis as 
the data is quite heterogeneous. 
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BO BbCKpbweHue MpaBegbHbIXb 
in resurrection righteous.GEN 
év tfj davaotdoet THV Sikaiwv. 
év tfj davaotdoet THV Sikaiwv. 


The particle xe is used for emphatic marking of the preceding con- 
stituent and has the semantics of a weak adversative conjunction, 
most often translating Greek dé ‘but, and’ - as in (7a) - and more rare- 
ly pév ‘while’, but also (in various contexts) té ‘and, also’, oùv ‘there- 
fore’, 54 ‘but, then’, yap ‘because’, vé ‘in fact, only’, kai ‘and’. In some 
cases, there is no parallel element in Greek, as in (7b); see also the 
variation in (7c) where oùv is translated as ‘xe oy6o’, only 'oy6o' and 
only ‘xe’ in the three manuscripts cited. 


(7)a 

OHb xe peye UMb Bparb  [...]  CM,CZ,Mt.13,28 

he EMPH told they.DAT enemy 

OHb xe [pe]ue ump Bparb CA 

6 Sè Zon aùtoîc, = ’Ex9pòg 

pa6u xe pbulA eMoy 

slavesEMPH told he.DAT 

Oui xe pBwA [...] 

Oi Sè S50bA01  Aéyouciv abii 

b. 

[...] npuxonurb xe HenpMb3Hb  [...] CM, CZ, Mt. 13,19 
comes EMPH evil-one 

[sl npuxonurb HenpubgHb  [..] 

[.] X &pxetat Ó rtovnpóc Lal 

C. 

erga xeoy6o npngetb rĤb BUHOrpaga [...] CM, CA, Mt. 21,40 

when therefore comes owner X vineyard.GEN 

erga xe npugetb fH» BUHOrpaga [...] CZ 

‘narga oy6o npuge Fb BUHOrpagoy [ec] CSb 

ötav — olv ZA9n ókópio; Tob üuneAOvoc [...] 


TI 
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Both xe and 60 may split a constituent - see (8) where xe separates 
3bro ‘much’ and 30Bzurrroy ‘who is calling (him)' (erux«pácovroc). 


(8) 

3bnoxesoBZ&uTOy MM Éro mHoro ý rnaroJizuuroy CSpr, 21r, 28-29 
much EMPH calling I.pAT he.Acc much and speaking 

'Erukpácovrog 5 uou aUi  Kaidéyovtos TOAAAKIG 


The complementiser 60 (Stawski 1974, 285-6) is predominantly found 
after a single constituent, which can be preceded by a preposition, 
a conjunction or negation particle ne (which can also precede 60). It 
translates the Greek emphatic conjunction ydp ‘because’. The use of 
oy6o as an emphatic particle is a very early development, mostly in in- 
terrogative sentences - oy6o is predominantly found after a wh-word 
(Cejtlin 1994, 721-2) and translates different elements in Greek: dpa 
uèv ‘then indeed’, oùv ‘therefore’, toivuv ‘therefore’ (each has specif- 
ic placement: oùv is predominantly in 2P, and uèv can be found after 
the definite article, i.e. phrase-internally). There are adjacent place- 
ments of oy6o and 60, with variant readings as in (9). 


(9) 

*ko 60 oy60 cb6upamTb — NNBBENDI. CM, Mt. 13,40 
as therefore collect weeds 

Bkoxe 6y60 nnbBeni cb6ipabkTtb CA CA 
KO oyno6b CeÓuWpaERTb — n/TbBe/Ibl CZ 
Worep  oUv OUAAEyetTat — xà áva 


The constituent preceding xe and 60 can be a verbal element, a noun 
or a pronoun, an adjective, an adverbial, a wh-word such as KTO 
‘who’, upto ‘which/that’, no usro ‘because of which’, etc., which can 
be preceded by a conjunction, negation particle, and/or followed by 
MH ‘I.DAT’, TH 'you.DAT', TA ‘you.ACc’, CA 'self.ACC', etc. 

In Middle Bulgarian texts observed here, both discourse markers 
are rare. In LTr, xe has only 9 occurrences, and 60 is found 7 times. 
The raw numbers given by Dimitrova and Bojadziev (2014) show that 
many later non-canonical sources exhibit higher numbers for oy6o 
but not for 60. In VB, 60 is missing and xe occurs 47 times, mainly 
in fixed expressions such as BenuKb xe n Mam ‘big but also small’ 
(26 occurrences) and erre xe (12 occurrences). xe is also found after 
pronominals such as the dative Bu ‘you.2pPl’ in: Bape rro BH zke pe- 
uer xynagH Kp(b)cTb u xynan Xanax ‘something that župan Krastyo 
and zupan Hanush told you’. 
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In the two 17th-century damaskins, xe and 60 are missing (ex- 
cept for in citations).! 


3.2 Pronominal Clitics, Semi-Clitic and Clitic-Like Elements 


The short forms of personal and anaphoric pronouns can be grouped 
into three subclasses according to their cliticisation status in OCS. 
The raw numbers in table 1 support the assumption that they were all 
on the path to becoming clitics (in some of the accounts cited above, 
they are already analysed as such) but there are differences as well. 

The first subclass includes the datives mu ‘I.DAT’ and Tu ‘you.DAT’, 
which are the first pronominals to exhibit the behaviour of clitics: ac- 
cording to Wackernagel (2009), the Greek datives pot ‘I.DAT’ and oot 
'you.DAT' were clitics, hence, their corresponding pronouns in OCS 
mu and Tu might have been clitics, as well (VeCerka 1989; Wacker- 
nagel 2009, 496, fn. 8 by D. Langslow). In the data observed here, 
they tend to appear in 2P (ca. 7396) preceded by a verb (ca. 6796 of all 
2Ps; and 7296 of all occurrences) or after a nominal element, an ad- 
verb, a conjunction such as ga ‘to’, amre ‘if’, bko ‘as’, and a wh-word 
(ca. 2596 of all 2Ps; and ca. 2196 of all occurrences). Although Migda- 
Iski (2016) claims that these were mainly ethical datives function- 
ing as 'operator clitics', a study on these specific cases by Krapova 
and Dimitrova (2015) shows that many of these datives can be ana- 
lysed either as affected participants, or as inalienable possessors, or 
as both (i.e. affected participants that are also inalienable posses- 
sors, see Minceva 1964). 

The second subclass of pronominal clitics comprises the so-called 
semi-clitics and can be further divided into two groups: a) the first- 
and second-person accusatives MA 'LACC' and TA 'you.ACC', which 
appear after a preposition and in first position; and b) the third- 
person accusatives (x ‘he.acc’, x ‘she.acc’, A ‘they.acc’, e 'it. ACC", b 
‘they.acc’), which often translate non-clitic Greek pronouns. Elements 
of both groups show a strong tendency to appear in second position, 
as well as post-verbally. There is a slight precedence of the second 
group to appear post-verbally: ca. 7896 (MA, TA) vs. ca. 93% (1, X, A, 
e, 5). When in second position, however, there is a clear difference 
between the two groups with respect to their pre-verbal placement: 
ca. 6% (MA, TA) vs. ca. 25% (1, x, A, e, 5). 

The third subclass of pronominals - involving pronouns that will 
be defined as 'clitic-like' in the present discussion - includes the 
short dative bi-syllabic emoy 'he.DAT', eu 'she.DAT', nm» ‘they.DAT’, etc., 


12 Inthe so-called archaic damaskins, however, both are still present, in the earli- 
er positions. 
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which have ambiguous behaviour with respect to discourse markers 
and the firsttwo pronominal subclasses, and whose successors in lat- 
er stages of Bulgarian (Moy 'he.DAT', Ài ‘she.DAT’, etc.) are clitics. They 
are also found predominantly in second position (ca. 7596) and, (even) 
more often, post-verbally (ca. 9296). However, only 796 of the clitic- 
like pronouns are both 2P and pre-verbal - this percentage is com- 
parable to the one seen in the preceding paragraph with respect to 
the first- and second-person accusatives MA 'LACC' and TA 'you.ACC' 
(the first subclass of the semi-clitics). 

These results may indicate, albeit it is a tentative proposal, that 
2P cliticisation was the dominant pattern, at least during the period 
witnessed by these OCS texts, and that this pattern was not in com- 
petition with pre-verbal placement but could co-exist with the lat- 
ter in case the word order would allow for it. The 2P pattern and the 
post-verbal one were not in competition either and perhaps can be 
seen as alternative mechanisms motivated also by word order and/ 
or information-structure principles and/or the syntax of the origi- 
nal Greek text. 


Tablei Pronominalclitics, semi-clitics and clitic-like pronouns in OCS 
(according to CM and CSpr) 


CL 2P, pre-V'' 2P, post-V 2P,non-V- non-2P,pre-V non-2P, post-V non-2P, 
adjacent non-V-adjacent 

MM, TU 75° 204 26 12 97 4 

eMOy, en, UMb TT 1053 17 23 362 T 

MA, TA, Hbl, Bbl 83 241 15 22 194 4 

WA,A,0, b 49 788 11 12 278 3 


i Clustering clitics are calculated as part of the cluster (as well as interrogative nu, monosyllabic 
adverbs n, tu, Toy), i.e. another clitic, which is part of the cluster, can precede or follow (e.g. 
npuBegbre mu.DAT n.ACC ckmo ‘bring him to me here’ will be calculated both for the dative mu and 
for the accusative n as 2P, post-V; pbun 0y60 HaM YbTO TU.DAT CA.REFL MbHUT® 'threrefore, tell us 
what you think’ will be calculated for the dative tu as 2P, pre-V). 

ii The constituent in 1P can be preceded by a coordinating monosyllabic conjunction such as u 
‘and’, a ‘but’, Hb ‘but’, etc., and the negation He. Elements after infinitives were calculated accordingly 
(noBent aHeynaTb npuHectu BpbTuwITE. U BbcaguTu U.ACC Bb He. ‘order the proconsul to bring the 
back and to putitinside (in-it)’ was 2P, post-V for the accusative n) but periphrastic verb constructions 
are not included. 

iii One or more constituents (noun phrases, prepositional phrases, adverbial phrases, including 
non-monosyllabic adverbs such as nak», nakbi) can be placed between the clitic/clitic cluster and 
the verb. 

iv Numbers do not involve periphrastic constructions, first-position elements or prepositional 
phrases. 


If clusterised, the dative mu ‘I.DAT’ and Tu ‘you.DAT’ tend to precede 
the accusative semi-clitics (and the reflexives), as in (10a-c). This is 
not true for the clitic-like datives, though, as in (10d). 
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(10) a. 

npusegbre MM n CbMO. CM, Mt. 17,17 
bring IDAT he.acc here 

pépeté pot aùtòv (6e. 

b. 

npuHecbte | MM A cbMo. CM, Mt. 14,18 
bring I.DAT they.Acc here 

npuHecbte A CbMO CA 
npuHecbere cbemo CSb 
MEpETE pot Wie aU toc. 

pépeté pot AÙTOÙÇ úse Scrivener 1894 
c. 

peu4 0y60 HaMb  UbTO TM C^ MbHUTb CM, Mt. 22,17 
tell then we.DAT what  you.DAT think 

eimé oùv fuiv, ti cot Soxet 

d. 

HazHaMeHa xe CA ÉMoy BbHbUb no Tb7oy: CSpr, 72v,4 
indicated EMPH ReflcL he.pAT wreath on body 

’EonudvOn SÈ avtod dotépavog kai Sia Tov coparoc. 


The placement of the pronominal elements often echoes the respec- 
tive placement of the parallel elements in the Greek text but with var- 
iant readings (especially if there were no corresponding construc- 
tions in Greek - Slawski 1946 - such as the reflexive constructions 
and periphrastic verb constructions). The verbal elements are dis- 
cussed further in the article; here I will only mention that clitics and 
semi-clitics demonstrate similar behaviour: both may precede the 
auxiliary and, by transitivity, also the participle (the active l-partici- 
ple as well as the passive n/m/t-participles). Both may be found in 2P, 
in the order X > CL > Aux > Participle, where X can be a wh-word, as 
in (11a), a subjunction such as ga ‘to’, as in (11b), or some other func- 
tional element in first position. 


(11) a. 
Kkako n 6m noroy6unu. CM, Mk. 3,6 
how he.Acc are.AUX.3Sg murdered 


ÖTWG aùtòv dTIOAEOWOLV. 
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b. 


[...] CbBbWUTaAWA ga u6x oy6unu CM, Mar. 3,6 
[...] consulted to he.Accare.AUX.3Sg killed 
[...] éBovAevoavto iva ÀTTOKTEIVWOLV AÙTOV. 


Semi-clitic reflextive pronoun cA ‘self.Acc’ exhibits variation in its 
behaviour just like the other semi-clitic pronominals: it may appear 
after the (reflexive) verb (more often) but also among the clustering 
elements (more rarely) in a slot of another pronominal and, more spe- 
cifically, in the slot of the accusative pronominals. 

We have already distinguished between ‘true’ clitics (Mu ‘I.DAT’ 
and Tu ‘you.DAT’) and clitic-like pronouns (emoy ‘he.DAT’, etc.) above, 
but their different status with respect to the cliticisation behaviour al- 
so manifests with respect to cA ‘self.acc’: Mu and Tu always precedes 
it, as in (10c), while the clitic-like pronouns typically follow it, as in 
(10d) above. 

If the semi-clitic cA ‘self.acc’, however, is found beside the ele- 
ments in the core in (5), as indicated by (10c), it may also appear in 
2P beside other elements as in (12) where it aspires for a slot within 
the cluster. In (21) below, we will see another example where ca is 
placed in front of the auxiliary but also after the main verb in com- 
pound tenses in a variant reading. 


(12) 


KO/IMKO CA 6bi Tpoyaund CSpr, 49r, 11-12 
how-much ReflcL be.AUX.COND worked 
Nooa av Exauec, 


As mentioned above, semi-clitics (reflexives included) are mainly 
post-verbal. Pronominal non-reflexive semi-clitics, though, may also 
appear either post-verbally, in most cases, or in 2P. If 2P, however, 
is occupied by a ‘true’ discourse clitic xe or 60, the semi-clitic is of- 
ten post-verbal, as in the examples in (13). This may signal that these 
semi-clitic pronouns do not behave as clitics yet. 


(13) a. 

MOTOMb Me o6pbre n lic. Bb [ipkme. CM, Jo. 5,14 
later EMPH found he.acc Jesus in temple 

MOTOMb Me o6pbre fic Bb LUPpKbBe CZ 


Metà taita ctpioxe: avtov  d’Incoùg v TG IEP@ 
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b. 

cenMb 60 nxb UMBUA BR XeHA. CM, Mk. 12,23 
seven becausethey.cEN had She.ACC  wife.Acc 

cegmb 60 ‘iMBUA — HK XeHX: CZ 
oi yàp éttà Éoyxov aùtfiv  yuvařka. 

C: 

OHU Me OYKOPULIA M [...] CM, Jo. 9,28 
they EMPH rebuked  he.Acc 

kai éAoi6ópncav aùtòv [...] 

’EXoSopnoav oùv = aUtóv [...] Scrivener 1894 


The inventory of clitics (and clitic-like elements) in OCS texts is rath- 
er limited. It is to be expanded in Middle Bulgarian but some forms 
are already attested in the early texts: for example, contracted forms 
of clitic-like datives, though very rarely and with variant readings, 
as in (14). 


(14) 

‘I Bb UpbKbBe CALITIO Moy’  npunoulA Kb Hemoy CZ, Mk. 11,27 
and in temple being he.DAT came to he.AT 

l Bb upkse XOA ALITIO emoy  npunx Kb HeMOy CM 


kai èv tiep@  neputavoÜvrog aütoU Epxovtal mpóc aùtòv 


The class of pronominal clitics expands in the Middle Bulgarian texts 
that I have considered. In L'Ir, the respective clitics occur in the same 
positions and order with the addition of third-person accusative ro 'he. 
acc’ and dative moy ‘he.DAT’, which behave as ‘true’ clitics. The earli- 
er forms, however (ero and emoy), are still found in LTr but in differ- 
ent functions. The accusative clitic ro ‘he.acc’ is used for direct ob- 
jects only, while the genitive (genitive-accusative) ero predominantly 
expresses the possessive genitive. The dative clitic moy 'he.DAT' re- 
fers to external possessors and is found in 2P, while emoy refers to 
indirect object (and is found after a verb and after some prepositions 
such as rnpoTuBA ‘against’). Expectably, ero and emoy do not cluster. 

The clustering elements exhibit the pattern in (5), as illustrated 
by the examples in (15), with the core (DAT > ACC) in 2P, and the ver- 
bal elements placed to its right (in AUX2). 
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(15) a 

[...] oymbicnutu, Kako ro “ pacbinatu LTr 
[...] think.iNF how he.acc also.EMPH scatter.INF 

b. 

[..] KONTO Moy CA W(Te)ub 66 HapeKknb LTr 
L.] who  he.bAT ReflcL father was called 

C. 

L.] Kakoto Xx cu n npbBoe USD. LTr 
L.] how she.acc be.Aux.2Sg  also.EMPH first heard 

d. 

[L.] na MY e UMA Mpuxia rpagb. LTr 
[L.] to  he.Acc is.coP.3Sg name Paris town 

e. 

[L.] nowHoro Moy 6n nBNOTbi npn6bino LTr 
[...] much he.DAT be.cOND beauty been 

f: 

L.] Ha MM ecu Tbi o6pok». LTr 


[..] but rbAT  is.coP.2Sg you.nom.2Sg oath 


In VB, pronominal clitics continue to cluster in the order DAT > ACC 
(including reflexives). As in (16a), the cluster is placed before the ne- 
gation particle and can be separated by the verb (although both pro- 
nominal clitics and the cluster are already mainly verb-adjacent). 


(16)a 

L.] ga my ra He Nuncat HM eguH  BAac VB 
L.] to  he.paT it.Acc NEG lack no one hair 

b. 

A ako MY ce KpuBO BMBMT Tomy [33] VB 


and if he.pAT ReflcL awry see this.DAT 
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The numbers in table 2 show clitics' overall preference to second po- 
sition and pre-verbal placement in VB. 


Table 2  Pronominalclitics in VB 


2P,pre-V 2P,post-V 2P,non-V- non-2P, non-2P, non-2P, 


adjacent pre-V post-V non-V- 
adjacent 
MM, TM, MOY, U, UM 560 43 21 14 15 2 
raro;io, Wx, me; 237 18 12 8 3 2 
Te, HN, BU 


In VB, a certain number of pronominal clitics are found after a mo- 
no-syllabic conjunction such as a ‘but’, u ‘and’, and after a pause, as 
well as in second prosodic position where it can split a noun phrase. 


(17) 

M MM none cnyra Moa, fa MU e VB 
and l.DAT came servant my to I.DAT is.AUX.3Sg 
Kfnure uo MY CMb Ka3an WT Bce 


bought what he.DAT am.aux.1Sg said from all 


In the two 17th-century damaskins that are took into consideration, 
pronominal clitics are also found in 2P but pre-verbal and post-verbal 
positions are almost equally distributed. Although rarely, the cluster 
may split a noun phrase, as in (18d). The example in (186) illustrates 
the portion of the pattern in (5): DAT ACC AUX2. 


(18) a. 


za énÁub ud wy ce Vizkopeuà cunata d tPóHáubcrBoro — DLv 


for one hour he.bAT ReflcL eradicated power.DEF and courage.DEF 


b. 


M | OcráBM cu ro Ha ounTe n Ha cpiéro, DTr 
and leave ReflPossCL it.acc on  eyes.DEF and on  heart.DEF 


C. 


á nio6oBb He yma HÁuo Mu ce He  npunába. DTr 
and love NEG have nothing I.DAT ReflcL NEG lend 
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d. 

a TorüBa árrn Mý ce THb ABM DTr 
and then angel he.pat ReflcL of-God showed up 

rópt Hat HIETO 


from-above over he.acc 


[s] aM ch Y3mU csoATa  kpena, petó DTr 
(a but ReflPossCL take own towel that 

mi A ch pána Ha pbnb 

I.DAT she.acc  is.AUX.2Sg given on deed 


Table3 Pronominal clitics in DTr and DLv 


2P,pre-V 2P,post-V 2P,non-V- non-2P, non-2P, non-2P, 


adjacent  pre-V post-V non-V- 
adjacent 
MM, TM, MOY, i, 538 418 52 26 49 3 
UM... 
Me, TE, Hbi, Bbi... 147 58 19 4 5 
TO, A, rbi, A... 586 425 44 17 33 2 


To sum up, the inventory of the clitical pronouns that are found in 
second position and pre-verbally increases in the history of Bulgari- 
an, mainly as a result of the changes in the inventory of pronominal 
elements. The accusatives ro ‘he.Acc’, s ‘she.Acc’, rei 'they.ACC' and 
datives Moy ‘he.DAT’, ii 'she.DAT' have joined the cluster after the re- 
analysis of the earlier forms of anaphoric pronouns. 


3.3 Verbal Clitics and Clitic-Like Elements 


Both in the early and in later texts, the behaviour of most monosyllab- 
ic verbal elements such as be-auxiliaries/copulas, future auxiliaries 
mx/ure ‘will/shall’, is ambiguous. They may be found in first position, 
including immediately after negation, and preceding the discourse 
clitics - as in (19), but also in 2P, within a cluster and in second pro- 
sodic position, splitting a noun phrase, as in (19b) - in parallel to the 
orders in the Greek text. 
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(19) a 

66 xe Toy mb marganbini.  [.] CM, Mt. 27,61 
Was.COP EMPH there Maria Magdalene 

"Hv Sè ékeî Mapía fMaySaAnvà [...] 

b. 

HATb Me 65 OTb HUXb 6oyu. [...] CM, Mt. 25,2 


five EMPH Was.COP of they.GEN wise 


; N x x aa 3 
mévite 6& noav ¿E aùtov opóvipot 


Auxiliaries are found either to the left, orto the right of the pronomi- 
nals among the clustering elements, as exemplified by the periphras- 
tic verb constructions in (20). 


(20)a 

[..] ko na 6b! HR nz6asunv Orb Ĥero: CSpr, 280r, 26 
[...] if to be.AUX.COND she.acc saved from he.GEN 

b. 

[..] t Bb Bonx ga n6unoroy6uni. CM, Mk. 9,22 


[.] and in water to he.acc be.AUX.COND murdered 


[...] kai eis Usata, iva droAéon.suBJ abtóv-. 


na 6n CA UXb KOCHAAJI b. CM,Lk. 18,15 
to be.AuX.coND ReflcL  they.Acc touched 


‘iva autav ÖTTNTALSUBJ 


The positions of the reflexive and the auxiliary often vary, with the 
former more often found in 2P, while the latter is adjacent to nega- 
tion, as in (21). 


(21) 

no6pba 6n 6'b1no emoy CM, Mt. 26,24 
better be.AUX.COND been he.DAT 

po6psée émoy 6n 6buno' CA 
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no6pbe emoy 6n 6bino: CZ 
po6p bee 6bino 6br emoy’ CSb 
kañòv ÑV.IND QUT 

aluTe CA 6n He pogunb Ukb Tb. 

If ReflcL be.AUX.COND NEG born man this 

awe 6i He poginb cA ünkb Tb 

áurre CA 6n He pogunb Ukb TE 

ae cA 6bI He pone Unkb Tb 

£L oùk éyevvn@n.IND 6 avapwitoG &kelvoc. 


In Middle Bulgarian, fluctuations in the positions of auxiliaries re- 
main, as exemplified in (22) from LTr. 


(22) a. 

n nongn nporuBx EKTOpoy Kpano, ga LTr 
and came against Hector king to 

6n CA OCTaABUNb rpbukbia X BOMCKbi 

be.AUX.coND  ReflcL left Greek army 

b. 

[...] n MeTHA npbiü oapwxa eroBa,  LTr 
[...] and threw in-front-of Paris his 

paBHo CA 6n Bb3BpaTUNb 

long-ago ReflcL — be.AUX.COND come back 


In VB, monosyllabic auxiliaries are mostly in AUX2, i.e. to the far 
right, where the latest clustering elements were joining, presuma- 
bly, as in (23a-b), with 6 examples of the reverse order of which 5 
are with cmo 'are.1pPl' and one - with the conditional auxiliary 6u 
"would' and negation particle - see (23c) (and 13 instances are with 
future auxiliary). 


(23)a 
[L.] epe He mn e Y3en  BpaxMau! rocnogctBa mu VB 
[.] that NEG I.DAT is.AUX.3Sg taken enemy master my 
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b. 

a npbrapu 3ampuxb TU BO6UTOKL, VB 
and burghers shut you.DAT cattle 

Ta me cy monun(u) MeHe. 

thus I.ACC Are.AUX.3PI asked l.ACC 

c. 

n 3a moy cnyx6y He 6n MY Muno VB 


and for my service NEG be.copP.conD he.DAT dear 


vs. 
ako MY He 6n natu WT BONA. VB 
if he.DAT NEG be.AUx.COND paid from will 


The trend continues into the Early Modern Bulgarian with auxilia- 
ries placed on the far right (including the present-tense ones), as in 
(24a-c), in contrast to the situation in present-day standard Bulgarian 
(compare also their positions in the dialects according to Antonova- 
Vasileva et al. 2016, map 144 where the majority of dialects instan- 
tiates the order AUX > DAT / ACC). There are again single instances of 
verbal clitics found before the pronominal ones, as in (24c). The or- 
der with respect to the reflexive, as in (24c), is the same as in pre- 
sent-day standard Bulgarian. 


(24) a. 


3aujo ro [47] MbUUNb, N  3auió ro [47] Mcmenunb DTr 
why he.acc is.AUX.2Sg tortured and why he.Acc is.AUX.2Sg driven away 


b. 


ak! Cb üQcrotáBane npáBo ra e &a rane DLv 
if_are.aux.3pPl defended rightly then_it.Acc_are.Aux.3pPl eaten 


C. 


3aroBá ca e BekE v_nokadHie DLv 
therefore_ReflcL_is.Aux.3pSg already also. repentance 
ZaroBá ca ce BeRE v_pénckbie 


therefore_are.Aux.2pPI_ReflcL already also_heavenly 
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d. 


a'MÀ cM €a OMpbcne MmoMÁH'cTBOTO npbai cBáÓ'ara DLv 
but ReflPosscL are.AUX.3pPl sinned virginity.DEF before wedding.DEF 


e. 


ToBá ca Hero cM Hb Cb nagune [...] DLv 
This are.COP.3pPl that ReflPosscL wEG are.AUX.3pPl kept 


Negation may separate clustering elements, with the reflexive to the 
left and the auxiliary to the right to form two prosodic words (whose 
constituents are linked via ' "), as in (25a) (see also (24e) with the 
possessive reflexive). Pronominal clitics are in second position while 
auxiliaries are verb-adjacent, as in (25b-c). 


(25) a. 

d Ouije ce Hb é po6pb  [mfkaHambctun]b [...] DLv 
and_yet_ReflcL NEG is.AUX.3pSg well here arranged [eel 

b. 

3amb gorgè BM Hangt Hb_è mpbBapuna cá6iía cbMmpbrHà DLv 


for until you.DAT before NEG_is.AUX.3pSg overtook sword deadly 


c. 
4 nornà ca nazápb Hb é pazBánwnb. DLv 
and until ReflcL market NEG is.AUX.3pSg broken 


Table 4 gives the order of clustering be- and future-auxiliaries (the lat- 
ter in Middle Bulgarian and Early Modern Bulgarian) and pronomi- 
nal clitics, semi-clitics, and clitic-like elements with the periphrastic 
verb constructions. The numbers attest for variation in the placement 
of auxiliaries among the clustering elements with preference to the 
right slot AUX2 of the pattern in (5). This is also true for the future 
auxiliary (the form mx was attested very early: Mircev 1956, 202; 
Haralampiev 2001, 149). In the texts here, the position of mx is al- 
so ambivalent (in AUX1 and AUX2), but it is negation-adjacent and al- 
so found in second prosodic position. 
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Table4 Pronominals and auxiliaries with periphrastic verb constructions 


Pron > Aux Aux > Pron Non-adjacent 
OCS 53 18 18 
MB 143 10 2 
EMB 103 25 4 


The clitic cluster in the noun phrase is left outside the scope of the 
present article but its phrasal position (2P) and order is very similar 
to that of the clausal cluster, with the earliest clitics to the left and 
the latest to the right (for comparison with Greek, see Eckhoff 2018). 
This cluster may not be considered viable in present-day Bulgarian if 
we assume that the definite article is an affix. The presumed pattern 
of the noun-phrase clitic cluster might have been DISC > DEM/ART > 
POSS/GEN/DAT > (AUX). There are very early examples of demonstrative 
pronouns that are in second position in the noun phrase and imme- 
diately follow a nominal element in first phrasal position: these are 
exactly the demonstratives that may be interpreted as clitic-like ar- 
ticles (Dimitrova-Vulchanova, Vulchanov 2012). However, there are 
also examples of discourse markers followed by demonstrative pro- 
nouns, clusterised in the same phrasal second position, which attests 
that the reanalysis has not been completed yet (as per Kurz 1963). 


4 ATentative Conclusion 


The data that have been discussed in the article backs the assump- 
tion that, historically, there might have been no significant changes 
in the position of the elements in the core of the primary clitic clus- 
ter, i.e. the pronominal clitics, with variations happening (and still) 
among the elements in the verbal periphery. The earliest 2P clitics 
were lost, somewhat expectably, because they were non-paradigmat- 
ic unlike those that have remained. The set of elements filling the re- 
spective slots among the clustering elements was expanded as a re- 
sult of the restructuring within the pronominal and verbal systems. 
Similar developments might have been at play for the (partial) loss 
of the noun-phrase cluster. A more thorough analysis of the variant 
readings across the monuments and constructions (matrix vs. subor- 
dinate clause, clause types, absolute constructions) is due in order to 
highlight the numerous intricacies of cliticisation and clitic patterns 
in the history of Bulgarian. 
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1 Introduction 


Torlak is a non-standard Old-Shtokavian Slavic variety spoken in the 
southern or southeastern area of Serbia and in the bordering are- 
as of Bulgaria and Macedonia (see fig. 1 below). It is also called the 
Prizren-Timok dialect in the attempt to delimit its distribution with- 
in the boundaries of Serbia. This variety has recently started get- 
ting more attention from the scientific community, mainly dealing 
with a variety of phenomena related to the central and north-east- 
ern Timok area.’ So far, the literature has focused on the postposi- 
tion of the article and on the use of pronominal clitics, claiming that 
Torlak is a transitional area having both Balkan and non-Balkan fea- 
tures. For instance, it has second-position clitics as Serbo-Croatian, 
but makes consistent use of direct object clitic reduplication as Bul- 
garian and Macedonian.’ 

When it comes to the use of articles, the Trgoviste-Torlak varie- 
ty presents an overt postposed t-type particle deriving from the de- 
monstrative pronoun taj/ta/to ‘that’, for instance: 


(1) a. Bulgarian 


dete-to 
child.DET 
‘the child’ 


b. Albanian 


ribar-at 
fisherman.DET 


’ 


‘the fisherman 


c. Romanian 
Zena-ta 

woman.DET 
‘the woman’ 


The particle can also enclitisize to the (first) adjective, for instance 
in (2), whereas the presence of a demonstrative pronoun incorporates 
the determiner as in (3). 


1 Runic 2013a; 2013b; Vuković, Samardžić 2015; Makarova et al. 2020; Vuković 2021, 
among others. 


2 Cf. Runi¢ 2014, 11-94 for an overview of the clitic system; Zivojinovié 2021 for a com- 
parative perspective on the use of clitics in Torlak. 
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(2) 

ubavo-to malecko dete 
beautiful.DET little child 
‘beautiful little child’ 


Ja gu Cuvam kako golupce, u 
I her guard/raise.PRES.1SG as pigeon in 
onuj buljinu što kuka cel 

that owl that complain/cry.PRES.3SG entire 

a ona izraste 

but she grow.PRF.3SG 

noc 

night 


‘I raise her like a pigeon (kindly), but she grew into that owl that cries all night’ 


The productivity and distribution of such particles seems to be con- 
siderably subjected to diatopic variation. Indeed, the literature on 
postposed articles and DP structures in the north-eastern and cen- 
tral areas shows instead some inconsistencies; for example, Runic 
(2014, 66) suggests that Torlak? is an article-less variety, whereas 
Vuković and Samardžić (2015) present the distribution of the post- 
posed overt article. However, the former study is based on a data col- 
lection in urban areas, such as the central Nis and Leskovac area, 
whereas the latter relies on fieldwork data gathered in north-east- 
ern isolated and urban areas of the Timok Valley. Indeed, Vuković 
and Samardžić (2015) show that the postposed article^ is productive 
in isolated rural areas (e.g. dubre-to 'the garbage") that are less in- 
fluenced by the Serbo-Croatian superstratum. 

Our research presents a contribution in the study of the postposed 
articles and DP structures by providing novel data from the south- 
ern sub-variety of Trgoviste, which borders Macedonia and Bulgaria 
(see the indication in fig. 1). The data was collected in the rural are- 
as of Trgoviste through the recordings of natural speech of 8 partic- 
ipants whose age range is 70-90. 


3 PTS (Prizren-Timok Serbian) in Runié's terms (2014). 


4 Specifically, Vuković and Samardžić (2015) argue that the Timok area maintains all 
three types of articles, namely the distal t- and n- types, deriving from demonstrative 
pronouns taj, ta, to and onaj, ona, ono 'that' as in brat-at 'the brother/that brother' and 
vrata-na 'the door/that door' respectively, and the proximal v-type, as in baba-va 'the 
grandmother/this grandmother’. 
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Figure 1 The overall distribution of Torlak 


This investigation is an additional puzzle piece to Stankovic (2013) 
who scrutinises three Torlak isoglosses® along with Macedonian 
in terms of BoSkovic’s generalisations (2008), showing that DP/NP 
partition is not a sustainable description for these varieties. What 


5 Stankovic (2013) relies on the partition of Torlak varieties provided by Ivić 
(1994), who distinguishes the following isoglosses: Kosovsko-Resavski, Prizrensko- 
Juznomoravski, Svrljisko-Zaplanjski and Timocko-Luznicki. Trgoviste-Torlak is located 
within Prizrensko-juznomoravski, an article-less variety (cf. Ivié 1994). 

ALL and LWA indicate article-less languages and languages with articles respec- 
tively. Trgoviste-Torlak is located within Prizrensko-juznomoravski (PJ). 
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Stanković (2013), who assumes that the southern Torlak isogloss is 
an article-less variety, proposes is rather a phonologically null vs. 
saturated DP structure on the basis of a set of structural and lexical 
properties which the isoglosses may or may not share. 

In this article we will address the postposition of articles by test- 
ing the nature of the Torlak enclitics and we delineate its domain by 
framing Trgoviste-Torlak within the Balkan Slavic context. Paragraph 
2 of this article illustrates the existing proposals on the Balkan DP 
structure, which mainly focus on Bulgarian and Macedonian data. 
Paragraph 3 tests the nature of the Torlak article-like particles fol- 
lowing Halpern (1992), whereas paragraph 4 provides further clues 
on the Torlak DP layer. Finally, paragraph 5 presents a brief compar- 
ative analysis with respect to the multiple determination phenome- 
non as opposed to Bulgarian and Macedonian and provides a note on 
the development of the Torlak article. 


2 State of the Art 


The issue surrounding the structure of postposed article has been a 
central topic in the generative literature." Accordingly, several pro- 
posals were put forward in order to explain the structure of the DP 
in the Balkan languages. 

In the following paragraphs, we will revise some of the most influ- 
ential assumptions regarding the DP in Balkan Slavic essentially re- 
ferring to Balkan Slavic languages, i.e. Bulgarian and Macedonian, 
to provide an appropriate starting point for the analysis of our Tor- 
lak data. In addition, some considerations on the nature of the post- 
posed article will follow. As such, we will mostly build on Halpern 
(1992) and Franks (2001) to shed light on the affixal properties ofthe 
postposed article found in Trgoviste-Torlak. 


2.1 Early Proposals on the Balkan DP 


The extensive bulk of literature on postposed articles has established 
a clear relationship between the presence of this affixal-like marker 
of definiteness in languages like Romanian, Albanian, Bulgarian and 
Macedonian to the Balkan Sprachbund, for instance: 


6 Tomić 1996; Dimitrova-Vulchanova, Giusti 1995; 1998; Franks 2001; Rudin 2018a; 
2018b for Balkan Slavic; Cornilescu 2016 for Romanian, among others. 
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(4) a. Bulgarian 
kniga-ta 
book.DET 

‘the book’ 


b. Albanian 


krevat-i 
bed.DET 
‘the bed’ 


c. Romanian 


baiat-ul 
boy.DET 
‘the boy’ 


Nonetheless, the assumption on whether the postposed article can 
be considered an actual inflectional marker has been widely debated. 
We will return to the affixal nature of the postposed article with spe- 
cific reference to Torlak in paragraphs 3, 4 and 5. However, it seems 
crucial to point out two major lines of research that consider post- 
posed articles either as (i) proper clitics (Scatton 1980; Tomić 1996) 
or (ii) inflectional affixes (Halpern 1992; Franks 2001; Rudin 2018a; 
2018c; Embick, Noyer 2001 among others). 

Tomić (1996), for instance, argues for a clitic-like nature of the ar- 
ticle, which is to be found in a Wackernagel position within the DP. 
She claims that the postposed article is generated on the D°, trigger- 
ing the movement of the highest, leftmost head to SpecDP. The move- 
ment of the noun, or whichever element follows the clitic, be it an ad- 
jective, numeral, or possessive, is considered a type of transformation 
applying from D-structure to S-structure in pure government and 
binding terms. In other words, the article as a nominal clitic is gen- 
erated in D° and triggers the movement of N to SpecDP, which ends 
up in a spec-head relation with the article. However, even though 
Tomić (1996) assumes that nominal clitics, i.e. articles and posses- 
sive clitics, are Wackernagel clitics, she also admits that they seem to 
show some typical properties that are normally ascribed to affixes.’ 

For this reason, Dimotrova-Vulchanova and Giusti (1998) dis- 
pensed with N-to-D movement,? arguing that Bulgarian does not dis- 


7 Ananonymous reviewer also points out the fact that, if we were to follow the idea 
that the article is merged in D, coordinated Ns would display the article only on the 
first noun contrary to fact. 


8 See also Bošković 2019 for a more detailed explanation to discard N-D raising in af- 
fixal article languages. 
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play any instance of movement of the noun to a higher position, as 
opposed to the optional movement of N, which is found in both Alba- 
nian and Romanian. This assumption allows the authors, on the one 
hand, to rule out the movement of N to D in the narrow syntax and, 
on the other hand, to rely on the movement of N to D at LF in order to 
check definiteness [Def] features. Franks' (2001) proposal goes along 
these lines, assuming movement ofthe highest head below D at LF in- 
cluding AP, which, following Abney's (1987) account, dominates NP.? 

These two accounts, therefore, provide a solid basis to consider 
the article in Balkan Slavic (and non-Balkan Slavic languages) a prop- 
er affix, regardless of their assumptions related to movement oper- 
ations taking place at LF. 


2.2 Current Assumptions on the Structure of the Balkan DP 


Abstracting away from the proposals that were analysed in $ 2.1, we 
now review Rudin's (20182; 2018b) assumptions as a starting point to 
better capture the structure of the Torlak DP. For now, we will con- 
sider the relationship between D? and X? as an agreement relation 
for definiteness, bearing in mind the existence of different propos- 
als (Koev 2011; Petroj 2020 among others) which argue for phi-fea- 
tures and definiteness agreement.'? 

Considering the structure in (5), Rudin (2018a; 2018b) adopts an 
Abney-type of structure in which the AP dominates the NP." She 
claims that D, being phonologically null, bears definiteness features 
that enter in an agreement relation with the articled word, namely 
the first head below the DP. 


9 InDimitrova-Vulchanova, Giusti 1998, the AP moves entirely to SpecDP and it checks 
[Def] features within the AP projection. 


10 Ina slightly different way, Giusti (2015) argues for the absence of a [Def] fea- 
ture, introducing a scattered head bundled with other features proper to the nomi- 
nal group, which could explain the presence of the inflectional article especially in the 
case of Romanian. 


11 An anonymous reviewer points out that the head status of adjectives in struc- 
tures like (2) is challenged by the fact that it fails to describe cases like the following: 


[op [p glavna-ta po značenie] pricina] Bg. (Cinque 2010, 47) 
Lit. ‘the main in importance reason’. 


We acknowledge Cinque (2010) in asserting the phrasal status of adjectives and their 
adjunction to the NP. However, we believe that, for the purposes of this paper, both ac- 
counts, i.e. AP dominating NP or AP adjunction to NP, allow us to describe the behav- 
iour of definiteness agreement with respect to post-posted articles in these Balkan Slav- 
ic varieties. Rudin (2018a) also points out that, even by adopting the perspective of ad- 
junction, definiteness agreement still appears on the highest leftmost head, namely A. 
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(5) 
DP 
D AP 
[+Def] AN 
A NP 
novi-te telefoni 


As it is shown above, the agreement relation between the adjective 
and the [Def] feature allows us to explain the presence of the inflec- 
tional article on the highest head below D, in this case, AP. Follow- 
ing the idea put forward in Rudin (2018a; 2018b), we could try to de- 
scribe definiteness agreement through the operation Agree (Chomsky 
2000; 2001) by adopting a bidirectional approach (see for instance 
Baker 2008; Bjorkman, Zeijlstra 2019 among others). 


(6) 
DP 
D XP 
MiDef] 
X+art 
[uDef] 


As show in (6), the uninterpretable [Def] feature acts as a probe for 
the interpretable [Def] feature on D; it therefore looks up, as an in- 
stance of upward Agree, and checks its [uDef].!? 

According to these theoretical premises, in principle, it could be 
possible to apply the structure in (6) to the Torlak DP. However, be- 
fore turning to the analysis of the DP of this Balkan Slavic variety, 
some considerations on postposed articles as inflectional markers 
for definiteness are needed. 


12 Franks (2020) proposes to consider this type of agreement in terms of feature shar- 
ing, where D assigns definiteness features to XP that, in turn, shares them with the head 
X. As such, this account considers definiteness agreement as a more morphological pro- 
cess than the operation Agree and crucially as a more local relation. 
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3 Testing the Nature of the Enclitic Article 


In the previous subsections, we mainly argued for the presence of 
an affixal-like postposed article specifically for Bulgarian and Mac- 
edonian. At this point of the analysis, it is necessary to understand 
where Trgoviste Torlak stands concerning the nature of the enclitic 
particles and whether the assumptions that were put forward for Bul- 
garian and Macedonian still hold for this Balkan Slavic variety. The 
analysis covering our Torlak data is carried out below, after provid- 
ing a quick review of the main phonological arguments to consider 
postposed articles as actual suffixes as outlined in Halpern (1992) 
and subsequently applied to Bulgarian in Franks (2001). 

Halpern (1992) proposes four "tests" to validate the hypothesis 
that articles are actual suffixes, i.e. final-devoicing as shown in (7), 
consonant-schwa metathesis as shown in (8), changes in the place- 
ment of the stress as shown in (9) and the appearance of a special 
form of the stem in the articled word as shown in (10). 


(7) a. Bulgarian 


bratovéed [bratoféet] 
‘cousin’ (Franks 2001) 


b. Bulgarian 


bratovéedat [bratoféedat] 
cousin.DET 
‘the cousin’ (Franks 2001) 


(8) a. Bulgarian 

grak* 

‘Greek’ (Franks 2001) 

* The letter represents a schwa. 


b. Bulgarian 

gárkát 

Greek.DET 

‘the Greek’ (Franks 2001) 


(9) a. Macedonian 


brAtuced 
‘cousin’ (Tomić 1996, 531-2) 
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b. Macedonian 


bratUcedot 
cousin.DET 
‘the cousin’ (Tomić 1996, 531-2) 


(10) a. Bulgarian 
interesen grad 
interesting city 
‘the interesting city’ 


b. Bulgarian 
interesnijat grad 
interesting.DET city 


‘the interesting city’ 


These tests, carried out by Franks (2001), clearly show the affixal 
nature of the post-posted article in Bulgarian ruling out their clitic- 
like nature. Building on these assumptions, it is now crucial to un- 
derstand whether the Torlak article displays any difference with re- 
spect to the neighbouring Slavic languages. 

Firstly, final devoicing is a systematic phenomenon in Trgoviste- 
Torlak, as in brod-brot ‘ship’, grad-grat ‘town’, Glog-Glok,** leb-lep 
‘bread’. Devoicing is blocked by the addition of the suffixal element 
that forms CVC syllables, for instance grat ‘town’ vs. gradat ‘the town’. 

The second test determined by Halpern (1992) is more problem- 
atic, as confirmed by Franks (2010, 111-13), who instead argues that 
the metathesis is the result of a schwa epenthesis (see (4) above). On 
this note, we do not identify relevant examples in our Trgoviste-Torlak 
corpus either related to (ii) or (iii). Nonetheless, it is worth noting 
that 34 syllable words containing a postposed t- particle bear the 
stress on the antepenultimate syllable, as in trAktor-at ‘the tractor’. 

The word length however plays a role, thereby allowing us to apply 
Halpren's fourth test, which broadly affirms that the masculine form 
of the article only attaches to long adjectival stems. This is clearly 
visible in adjectives such as ubav ‘beautiful’, or nov ‘new’, which do 
not allow forms such as *ubav-at muž or *nov-at stol, but require a 
restructuring of the syllable arrangement in the adjective, obtaining 
ubavijat muz ‘the beautiful husband’ or novijat stol ‘the new table’. 

The application of the tests related to the nature of the enclitics 
provides therefore a partial outcome. By claiming such results, we 
intend that the t-particle may not be a fully functional suffixal ele- 


13 Glogis a village located in the area of Trgoviste/Vranje. 
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ment, but it could be retaining some degree of lexical value, hence 
the absence of the particle in examples containing a demonstrative 
(cf. (3) illustrated above). 


4 Looking for the Torlak DP Layer 


Coming back to the analysis of the Balkan Slavic DP, it seems clear 
that the assumptions that we put forward in $ 2.2 may be challenged 
by the behaviour ofthe Torlak article with respect to Halpern's tests. 
As we previously argued, these articles partially fit the tests, show- 
ing different outcomes compared to the Bulgarian and Macedoni- 
an ones. Considering these results, it may be useful to go deeper in- 
to the analysis of the Torlak DP, testing whether this understudied 
variety fits BoSkovic’s (2008; 2012) diagnostics as an NP or DP lan- 
guage. Tasseva-Kurktchieva and Dubinsky's (2018) study already ap- 
plied some of BoSkovic’s (2012) 18 diagnostics on Bulgarian and their 
results showed that this Balkan Slavic language falls in a category 
that is neither the one of full DP nor a full NP language. Their propos- 
alis that Bulgarian is, in fact, a weak DP language that projects the 
DP layer only in the presence of the definite article. Following from 
their results, we applied the same diagnostics, checking the behav- 
iour of the structures listed in table 1, to Trgoviste-Torlak in order 
to find out its status and the behaviour of the DP projection in this 
understudied variety. 


Table1 Syntactic contexts based on Bošković 2008 
i) Clitic doubling 

(ii) Left-branch extraction 

(iii) Negation raising 

iv) Scrambling 

(v) Presence of majority superlative reading 

(vi) Superiority and Multiple Wh-Fronting 

vii) Adnominal Genitives 

(viii) Head-Internal Relatives and Locality 


Specifically, clitic doubling and the related generalisations have been 
widely discussed in Živojinović (2021), claiming that Trgoviste-Torlak 
stands in a compromise position showing overt postposed articles 
and allowing clitic doubling, but having Wackernagel-type clitics (11). 
On this matter, Bošković generalises that (i) second position clitic 
systems are only found in NP languages, (ii) only languages with ar- 
ticles may allow clitic doubling, (iii) there is no clitic doubling with 
second-position clitics. 
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(11) 
Odamna ga upozna Milovana. 
long_time_ago him.CL.ACC met Milovan 


‘I met Milovan a long time ago’. 


When it comes to left-branch extraction, Trgoviste-Torlak does not al- 
low such structures (12), following Boskovic (2008) who affirms that 
only languages without articles may allow LBE. 


(12) 
*Maleckoto vido dete. 
little.DET saw child 


While Torlak does not allow scrambling, as predicted for languages 
with articles, the Neg-raising test requires some further explanation. 
Precisely, Bošković (2008) claims that the negation in a matrix clause 
negates the content of a subordinate clause and in such contexts it 
makes use of a licensed negative polarity item. However, Stankovic 
(2013) observes that languages without articles, such as Serbo-Croa- 
tian, exhibit Neg-raising, but do not license an NPI with verbs such as 
believe. This observation can be extended to Trgoviste-Torlak as well. 


(13) 
Ivan ne veruje da Bog[k] postoji. 
Ivan not believe that God exists 


‘Ivan does not believe that God exists’. (Cf. Stanković 2013) 


Trgovište-Torlak does not show the majority superlative reading un- 
like varieties such as Slovenian (for instance, Največ ljudi pije pivo 
‘Most people drink beer’). Similarly, it does not show strict superior- 
ity effects to multiple wh- fronting, an expected feature of varieties 
with articles (e.g. Koj koga vidi? / Koga koj vidi? lit. Who whom sees/ 
whom who sees). On the same line, our Torlak subvariety does not 
provide examples of two adnominal genitives and only allows head- 
external relatives. 

Once again, the overall position of Trgovište-Torlak seems to be a 
compromise one, partially fitting Bošković’s (2008) generalisations and 
presenting features belonging to article, but also article-less varieties. 
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5 Discussion and Further Remarks 


The discussion on definite articles in Torlak and the analysis of the 
previous sections allows us to make some further remarks, specifi- 
cally concerning the comparison among Bulgarian, Macedonian, and 
Torlak. An interesting phenomenon worth discussing, which interests 
mostly Bulgarian and Macedonian, falls under the name of Multiple 
Determination (MD). The presence (or absence) of MD provides us 
with the opportunity to shed some light on the properties of the suf- 
fixal articles found in the three languages in question. The frame- 
work that we adopt follows Rudin (2018c) with the addition of a nov- 
el proposal aiming at explaining the anomalous behaviour of Torlak 
compared to the other neighbouring varieties. Namely, we will call 
into question the process of partial or total grammaticalization of 
the article and its outcome in the different languages. 


5.1 Multiple Determination 


As anticipated, one ofthe phenomena that is worth analysing to better 
understand the characteristics and peculiarities of the DP in Torlak 
is Multiple Determination (MD). MD is defined as the presence, inside 
the DP layer, of a double or multiple realisation ofthe DP in certain en- 
vironments (Alexiadou 2014). There are different hypotheses that try 
to explain the functioning of MD, two of the most relevant are (i) the 
split DP-Hypothesis and (ii) the 'distributed' DP-Hypothesis. Accord- 
ing to the first one, the DP is divided into at least two layers that con- 
tribute differently to the meaning of the structure. This distinction is 
between a part where deixis is encoded and another part where deter- 
mination is: [DP1 Deixis [DP2 Determination]]. According to the oth- 
er hypothesis, instead, the Det can realise several other non-D related 
projections within the extended projection of the noun. 

In the Balkan Slavic languages analysed, namely Bulgarian and 
Macedonian, MD is characterised by the presence of a Demonstra- 
tive and one or more definite article suffixes (Rudin 2018c), like the 
examples in (14) show: 


(14) 
a. Bulgarian 

tija novi-te telefoni 
these new.DET phones 


‘these new phones’ 
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b. Macedonian 


tie ubavi-te fustani 
these pretty.DET dresses 
‘these pretty.DET dresses’ 


According to Rudin (2018c), in Bulgarian and Macedonian MD is used 
in colloquial speech and has a specific pragmatic reading that can be 
judged either positively or negatively by the speakers. However, as it 
will be demonstrated below, the situation concerning MD in Torlak is 
dramatically different. In particular, any instance of MD is judged un- 
grammatical by speakers of this language. Consider for instance (15). 


(15) Torlak 
*ovoj ubavo-to dete 
this beautiful.DET child 


‘this beautiful.pET child’ 


Going back to the structural analysis of the DP, we assume that in case 
of MD, the Dem is located higher up in the structure. Furthermore, 
we hypothesise that in Bulgarian and Macedonian there is a split in 
the DP and that the features that enter in an agreement relation are 
those present in the D head while those belonging to the Dem are on- 
ly deictic. As such, since the featural content of the two heads does 
not interfere with one another, MD is allowed (a). In Torlak, instead, 
there is no split in the DP between the features of the Dem and of the 
D head and for this reason agreement is possible only from one head 
at a time (b). The two structures in (16) are taken from Rudin (20180). 


(16) 
a. 
fr 
jas 
Demonstrative 
D 
| 
*Def 
X+art 
» «Def Y(*art) 


# 


» +Def 
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DemP 


Dem DP 


demonstrative 


5.2 Grammaticalization Hypothesis 


The peculiar characteristics of the DP in Torlak, which seems to be- 
have differently with respect to other Balkan Slavic languages like 
Bulgarian and Macedonian, can be explained as caused by a process 
of partial grammaticalization. The grammaticalization path that is 
taken into consideration here is the one used to describe some Scan- 
dinavian varieties too, for instance Nynorsk Norwegian, namely con- 
tent item > grammatical word > clitic > inflectional affix (Faarlund 
2018, 618, the original theoretical framework from Hopper, Trau- 
gott 2003). In the case of Macedonian and Bulgarian, the articles 
seem to have undergone a full grammaticalization into purely func- 
tional elements and for this reason they are able to occur with a de- 
monstrative in a MD construction. In Torlak, instead, the grammat- 
icalization of the Det is only partial resulting in an inflectional affix 
that maintains the demonstrative semantics. We argue that the pe- 
culiarity of the situation in Torlak is that the Dem and the Det share 
the features that are semantically encoding part of their meaning, 
namely the demonstrative ones. This sharing is not allowed because 
the two heads are not differentiated enough due to the grammatical- 
ization of the determiner, that is only partially undergone in Torlak. 
This claim is also supported by the results of the tests carried out in 
the previous sections, which confirm a compromise position of the 
article.! In the case of Bulgarian and Macedonian, instead, the com- 
plete process of grammaticalization renders the Det a purely func- 
tional element that does not enter into competition with the Dem for 
the semantic features encoding [+Dem] and hence the construction 


14 An anonymous reviewer asks whether Torlak affixes encode deictic differences 
as the Macedonian article and the variety of Bulgarian spoken in the Rhodope moun- 
tains. As we previously argued, Trgoviste-Torlak only displays a t-type of article; how- 
ever, some other varieties of Torlak (cf. Vuković, Samardžić 2015 for the Timok area) 
may retain this distinction. 
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allows for MD. Being grammaticalization a gradual process, even in 
Bulgarian and Macedonian it is possible to find articles that have not 
completely undergone semantic bleaching. 


6 Conclusion 


In this paper we attempted to provide a contribution to the study 
of the Balkan Slavic postposed article by providing novel data from 
an understudied non-standardised variety. The study highlights yet 
again the transitional nature of Torlak that balances Balkan and non- 
Balkan features. Indeed, Torlak and specifically Trgoviste-Torlak pre- 
sent a postposed suffixal article-like element, but they do not allow 
the use of multiple determination. We argued that this behaviour with 
respect to MD is explained by a lack of a split in the D features in Tor- 
lak, which is instead present in the neighbouring Bulgarian and Mac- 
edonian. To conclude the article, we noted that the peculiar behav- 
iour of the DP in Torlak could be due to a partial grammaticalization. 


Abbreviations 


ACC Accusative 

cL Clitic 

CVC Consonant-vowel-consonant 
DET Determiner 

DP Determiner phrase 
LBE Left branch extraction 
NP Noun phrase 

NPI Negative polarity item 
PRES Present 

PRF Perfect 

SG Singular 


XP X phrase 
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1 Introduction 


Although migration from both pre- and post-Soviet Russia is polyeth- 
nic, this aspect has been of interest to researchers mainly in rela- 
tion to three groups: ‘Russian Germans’, ‘Russian (Soviet) Jews’ and 
ethnic Russians (see, e.g., Ben-Rafael et. al. 2006; Isurin 2011; Dietz 
2000). Representatives of other ethnic groups were usually consid- 
ered as an integral part of the undifferentiated mass of ‘Russian’ em- 
igration, which in the ordinary sense means not only citizens of Rus- 
sia, but also all immigrants from the former Soviet Union in general 
(hereinafter FSU).* Non-Russian ethnic groups from this region, es- 
pecially in the post-Soviet period, mainly attracted scholarly inter- 
ests of cultural anthropologists, ethnologists and sociologists. Al- 
though their research touched upon language issues as well (see, 
for example, Guchinova 2004), linguistic works concerning ethnic 
(minority)? languages of Russia in the migration context are scarce, 
compared to the countless literature on the Russian in migration (see 
e.g. Anstatt 2012; Besters-Dilger 2013; Glovinskaya 2004; Zemskaya 
2001). These few linguistic and sociolinguistic works on non-Russian 
languages and migration address topics of language and identity (Yu- 
supova, Nabiullina 2016; Bedretdin 2017; Khilkhanova 2017), struc- 
tural and functional features of Tatar and other languages in foreign 
diasporas (Akhmetova 2004; Nabiullina, Yusupova 2015; Khilkhano- 
va 2020; Bitkeeva 2018), mechanisms and consequences of language 
contacts with host country languages, e.g. Chinese and English (Na- 
biullina, Yusupova 2014; Yusupova et. al. 2013; Aliev 2017). 


The Author thanks many people and organisations who contributed to this research. 
First, the project could not be done without the cooperation of migrants from Russia 
and Kazakhstan. Though I cannot name them here I nevertheless would like to thank 
all the anonymous informants who shared their life stories and sincerely answered 
my questions. 

I would also like to express my gratitude to Dr. Günther Schlee and the staff and re- 
searchers of the Max Planck Institute for Social Anthropology (Halle/Saale). 

Last but not least, I owe my deep thanks to the German Academic Exchange Service 
(DAAD) that provided financial support for this project. The reported study was also 
funded by the Russian Foundation for Basic Research (RFBR) and Deutsche Forschun- 
gsgemeinschaft (DFG), project number no. 21-512-12002 HHHO Prognostic Methods and 
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1 Theshortcoming ofthe definition of 'Russian' is also due to the fact that most West- 
ern European languages do not have the differentiation between the concepts of rossi- 
jskij and russkij appeared during the perestroika, where the first denotes citizenship, 
and the second means ethnicity (along with the term ethnicity in the Russian scientific 
tradition, the term nationality is often used). 


2 Inthe Soviet and especially in the Russian context, all non-Russian languages fall 
under the definition of ‘minority languages’ related to the languages of those ethnic 
groups that represent a demographic minority in the country, and languages whose 
functions and scope of use are limited compared to the official language of this region 
(in this case, Russian). 
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I am interested in another linguistic aspect of post-Soviet migra- 
tion, that is how language ideologies manifest themselves in the mul- 
tilingual strategies and practices of the multiethnic body of migrants 
from the FSU, regulating their language attitudes and language prac- 
tices. This problem is studied in this article in connection with the 
concept of translanguaging, which is becoming increasingly popular 
in modern studies of multilingualism, language contacts and migra- 
tion. Translanguaging means “the ability of multilingual speakers to 
shuttle between languages, treating the diverse languages that form 
their repertoire as an integrated system” (Canagarajah 2011, 401). 

This problem statement stems from my observations made during 
a field study conducted in 2016 within the framework of a project Lan- 
guage and Ethnic Identity of Non-Russian Immigrants from the For- 
mer Soviet Union in Western Europe. The aim of the project was to 
examine processes of negotiation, (re)construction and transforma- 
tion of ethnic identity and its connection with language and culture 
among non-Russian migrants from the FSU. This multiethnic body of 
immigrants shared some common characteristics that were essential 
for my research aim and made them an interesting and little-studied 
research object: they were united by a common history, (Soviet) cul- 
ture and (Russian) language, but at the same time represented other 
ethnicities and cultures. From the linguistic point of view, each eth- 
nic group was multilingual and had at least three languages in their 
linguistic repertoire: the ethnic language (L1) (sometimes only at a 
symbolic level), the Russian language (L2) and the host country lan- 
guage/languages (L3-n). 

The project aim did not include the study of translanguaging, but 
during the interview and participant observation I noticed a very lim- 
ited use of the migrants' multilingual repertoire, lack of language 
creativity and switching between languages - what is usually defined 
as translanguaging. This fact was particularly striking against the 
background of quite frequent translanguaging strategies and prac- 
tices of migrants in different regions of the world (Lee 2014; Li 2018; 
Pennycook 2008). I assume that this is due to language ideologies 
acquired in childhood that continue to guide (to a certain extent) 
the speech behaviour of migrants whose acculturation took place 
in the FSU. Therefore, the goal of this article is to examine multi- 
lingual practices of first-generation migrants from the FSU from a 
translanguaging perspective aiming to reveal and analyse translin- 
gual resources (or their absence) used by this multilingual and mul- 
ticultural group and to uncover language ideologies underpinning 
these practices. 

This research falls within the frame of critical ethnographic so- 
ciolinguistics. At the beginning of the article, the theoretical frame- 
work of this study is considered, namely concepts of translanguaging, 
code switching and language ideologies (paragraph 2). Then, I will 


119 
Balcania et Slavia e-ISSN 2785-3187 


1, 1, 2021, 117-148 


Erzhen Khilkhanova 
Language Ideologies and Multilingual Practices of Post-Soviet Migrants 


provide methodological considerations related to the characteristics 
of the social research object and research materials (paragraph 3).I 
will then examine translingual practices of post-Soviet migrants in 
their oral speech and informal written communication in social net- 
works (paragraph 4). The next section is devoted to the description 
and analysis of several language ideologies in the historical context 
of the Soviet language policy and in the modern context of migrants' 
life in Western Europe. Finally, the summary of the results of this re- 
search is provided and conclusions are drawn. 


2 Theoretical Framework: Concepts of Translanguaging, 
Code Switching and Language Ideologies 


Language ideologies are the systems of ideas about social and lin- 
guistic relationships and how they relate to social values (Woolard 
2003; Kroskrity 2000, 5). They have an affective dimension and are 
related to what Cavanaugh calls a "social aesthetics of language" 
(2009, 194-5). They are also defined as "the cultural conceptions 
of the nature, form, and purpose of language, and of communica- 
tive behavior as an enactment of the collective order" (Gal, Wool- 
ard 1995, 30). 

Sociolinguists distinguish many ideologies related to the use or 
non-use of ethnic, or minority languages. These include folklorisa- 
tion, hypertraditionalisation, and the association of the language 
only with the past and old times (Sallabank, Marquis 2018; Woolard 
2020). Many ideologies revolve around the concepts of legitimacy, 
‘language ownership’, and authenticity usually going hand in hand 
with language purism (Pischlóger 2016). In real life, these ideologies 
are intertwined with each other and their action is carried out in a 
complex way; often contradictory ideologies coexist in the minds of 
the same people. 

In the context of my research, it is important to identify and show 
how language ideologies can act as an internal regulator of people's 
linguistic behaviour. Language ideologies are also closely related 
to translanguaging, since they are the source of linguistic varia- 
bility and the driving force for the creation of new hybrid ‘languag- 
es’ by people who are on the border of two or more ‘worlds’. This is 
why translingual strategies are so common in the migration envi- 
ronment. Migrants invent a new language to reflect their inner real- 
ity determined by their migration experience and destabilised iden- 
tities (Hüppauf 2004). 

The theory of translanguaging, which originates within the frame- 
work of pedagogy, has long gone beyond its framework and is active- 
ly used in modern studies of multilingualism and language contacts. 
Today, linguists are already saying that it is necessary to distinguish 
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between the pedagogical use of translanguaging and spontaneous 
translanguaging, the latter referring to the complex discursive prac- 
tices of bilinguals (see Garcia 2009). In addition to the well-known 
definition by Canagarajah given above, translanguaging is also de- 
fined as a smooth synergetic transition from one linguistic culture 
to another, as a result of which there is some merging of them, while 
there is no complete assimilation and the linguistic and cultural iden- 
tity of language users is preserved, and a mixed discourse is creat- 
ed (Canagarajah 2002). Translanguaging implies the permeability of 
languages, their mutual influence, as a result of which a new quali- 
ty of an enriched linguistic culture arises. The translingual area is 
characterised by blurriness, indistinctness, uncertainty, “linguistic 
fluidity” (Lee 2014). In language studies and in language pedagogy, 
the translingual approach focuses not on language as a system, but 
on language as a practice, i.e. activity (Lee 2014, 305). 

It should be emphasised that translanguaging is the use of a speak- 
er's entire communicative repertoire, as a result of which languages 
and cultures smoothly flow into each other, sometimes helping, and 
sometimes hindering (Pennycook 2008). Code switching and code 
mixing are considered one of the translingual strategies (Pennycook 
2008, 30.4), or as an instance of translanguaging, alongside other 
bilingual phenomena such as translation, borrowing, and additional 
processes, in a range of modalities (MacSwan 2017, 191). 

Thus, translanguaging is, firstly, a broader phenomenon including 
code switching (hereafter CS) and code mixing (hereafter CM). Sec- 
ondly, the more fundamental difference between them is the paradig- 
matic difference. The translanguaging concept belongs to the post- 
colonial paradigm (see Pennycook 2006), while the notions of CS and 
CM appeared in earlier studies of multilingualism and language con- 
tacts. This causes a difference in approaches: the terms CS and CM 
are based on the vision of languages as separate systems. According- 
ly, speech, where there is a mixing and switching of codes, was of- 
ten assessed rather as defective speech, since the reasons for these 
phenomena were seen in the speaker's insufficient linguistic compe- 
tence in any ofthe languages involved, or even in both. This is the ba- 
sis for the designations of such mixed speech as "macaronic" (Bert- 


3 The term code switching is used here as a term with a broader meaning, which in- 
cludes codes mixing (see also Myers-Scotton 1993, 1; Gumperz 1982, 59). The prefer- 
ence for the term code switching is also due to considerations of brevity and conveni- 
ence. At the same time, I understand by code mixing the process of using two (or more) 
codes (languages) in speech, which has acquired a more or less regular form and has 
gone further than just code switching. But the most important differentiating criterion 
is grammatical: if the morphosyntactic rules of both languages are not violated, this is 
code switching. If the lexemes acquire morphological indicators of another language, 
this is codes mixing. In other words, as long as the morphosyntactic basis of a particu- 
lar code is preserved, we can talk about CS. If not, about CM. 
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agaev 1969, 127), “mestizo language”: in relation to the FSU, these 
characteristics were often used to describe speech, where Russian 
and other languages of the peoples of the USSR were mixed. A pu- 
ristic, normative approach to language is clearly seen behind these 
epithets. The translanguaging approach is fundamentally different 
and positive. Translanguaging 


has a pronounced active and lingvocreative character, representing 
a new, creative, transformative and integrative use of all bilingual 
resources. Additional opportunities for the lingvocreative activi- 
ty of bilinguals arise precisely in the ‘indefinite zone’ on the bor- 
der of languages. (Proshina 2017, 161; see also MacSwan 2017, 190) 


Since the main difference lies precisely in the approach, in the as- 
sessment of translanguaging and CS phenomena, I believe that a re- 
searcher can, however, work with the terminological apparatus tra- 
ditionally used in CS research, as it is done, for example, in research 
by Karpava et al. 2019. Otherwise, the scholars remain both without 
a terminological apparatus, and, as J. MacSwan rightly notes, with- 
out empirical case and are left only with an ideological one that de- 
nies languages as social and political constructs (MacSwan 2017, 
169, 177). I generally support MacSwan's multilingual perspective on 
translanguaging, which acknowledges the existence of discrete lan- 
guages and multilingualism reconciling educational (that is, trans- 
languaging) and linguistic research. The reconciliation becomes 
possible through distinguishing mental grammars from linguistic 
repertoires, arguing that bilinguals, like monolinguals, have a sin- 
gle linguistic repertoire but a richly diverse mental grammar (Mac- 
Swan 2017, 167). 

Since, speaking of translanguaging, we talk primarily about in- 
dividual bi- and multilingualism,^ a look at what motivates people 
to switch and mix codes sheds additional light on the mechanism of 
translanguaging. Factors that contribute to the dynamic use of two 
or multiple languages in a particular instance can be broadly divid- 
ed into three groups - external, internal, and linguistic: 


1. external factors, i.e. independent of a speaker (such as histo- 
ry, politics or demography); 

2. internal factors, i.e. attached to a speaker, both as an individ- 
ual and as a group member (such as psycholinguistic, prag- 
matic, sense of ethnic identity); 


4 Although, as J. MacSwan rightly notes, a language is the product of a communi- 
ty of speakers, a collection of overlapping individual languages (MacSwan 2017, 174). 
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3. linguistic factors (such as lack of a concept in a minority lan- 
guage-L1, insufficient linguistic competence of speakers in L1). 


Although all the factors are intertwined and act in a complex way in 
a real language contact situation, translanguaging and CS from the 
point of view of this scheme are motivated primarily by internal fac- 
tors. External or linguistic factors also contribute to "multiple dis- 
cursive practices in which bilinguals engage in order to make sense 
of their bilingual worlds" (García (2009, 45), but in a more indirect 
way. In some way, only unconscious or, vice versa, conscious, pur- 
poseful cases of CS driven from within can be considered truly trans- 
lingual strategies. The first ones are valuable for sociolinguistics be- 
cause 'automatic' CSs, when a speaker would like to, but cannot speak 
without switching the code, or when a mixed code has become a kind 
of norm for the speaker (or for the entire language community) are 
the true indicators of real processes occurring in the contacting lan- 
guages. Automatic 'slips of the tongue' in another language can be 
caused by such psycholinguistic factors as (i) insufficient linguistic 
competence in L1 combined with higher linguistic competence in a 
majority language (L2); (ii) the habit of speaking one of the languag- 
es from the bilingual repertoire (a stereotype of speaking behaviour); 
(iii) linguistic economy, or economy of speaking efforts. 

Vice versa, a deliberate, goal-oriented CS discovers various speak- 
ers' intentions: for instance, the translingual strategy of mixing lan- 
guages can be chosen consciously - as a special slang - to construct 
one's group identity, which also includes ethnicity (Moustaoui, Pre- 
go, Zas 2019). Vivid examples of ‘protest’ motivation can be found 
in translingual/transcultural literature created by bi- or polylingual 
writers. This is, for instance, the book by the German-Turkish author 
Feridun Zaimogly Kanak Sprak. The book's language is the language 
of protest, violence, obscenity, 'dirty' metaphors and slang, but it is 
also some kind of artificial language containing words and expres- 
sions that are not present in either German or Turkish (Yildiz 2004). 
Similarly developed and used is the pidginised German of 'Russian 
Germans' called 'kvelya' and 'legitimised' by Vladimir Kaminer's 
book Russendisko (Kaminer 2000; Kirilina 2011). My research, how- 
ever, focuses not on professional creative writing 'at the crossroads 
of cultures' but on 'ordinary' multilingual migrant speakers and their 
translanguaging practices through which they represent their expe- 
riences and relationships in their new localities. 

As already mentioned, translanguaging is an integral feature of 
increasing migration flows in the modern world. In the sociolinguis- 
tic literature, quite well studied are the 'big' languages involved in 
the translingual process in a migration situation. These are, for ex- 
ample, new transnational forms on the border between German and 
Turkish languages (Zaimogly 2004), translanguaging practices in the 
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home among bilingual/multilingual Russian-speaking children and 
their parents in Cyprus, Sweden and Estonia (Karpava, Ringblom, 
Zabrodskaja 2019). But the most widespread translingual forms are 
(for obvious reasons) the so-called World Englishes (including Rung- 
lish, Spanglish, Chinglish etc.) (Li 2018; Pennycook 2008). From the 
point of view of translanguaging theory, these combined variants that 
have arisen as a result of cultural and linguistic contact and carry 
the features of both English and other languages (sometimes serv- 
ing as a means of communication for entire countries) are not defec- 
tive speech, but specific translingual formations. 

With the approaches, definitions and theoretical discussions pre- 
sented above in mind, let us now turn to the methodological consid- 
erations and the description of data collection procedures. 


3 Methodological Considerations 


The data include a corpus of spoken materials, collected as part of 
the field research conducted in the period from July to October 2016 
in four Western European countries (Germany, France, Netherlands, 
and Great Britain with the focus on Germany).° A total of twenty-six 
in-depth, semi-structured interviews lasting from one to several hours 
were collected. The interviews were conducted, recorded and tran- 
scribed by the author in person. Apart from the interviews, the data 
were collected employing extensive ethnographic observation. From 
an ethnic point of view, the pool of informants included Buryats, Ya- 
kuts and Kazakhs from two countries, i.e. Russia (twenty-four peo- 
ple) and Kazakhstan (two people). Of course, there is a bias towards 
the Russian area, which is explained both by the complexity of data 
collection among migrants and by a rather short field study duration. 
Therefore, the conclusions are made mainly on the Russian material, 
although the commonality of background and linguistic strategies of 
post-Soviet migrants are very much alike, as evidenced in other stud- 
ies on post-Soviet migration (Kasatkina 2011). Interviews typically last- 
ed one to two, sometimes more hours, and were usually held in pub- 
lic venues, such as restaurants, local businesses, and private homes. 
The diagrams below [figs 1-2] show two important characteristics 
of the sample: age and the fields of professional activity/status of in- 
formants. I divided the study participants into four age groups, and 


5 Allsubjects gave their informed consent for inclusion before they participated in the 
study. Transcripts are fully anonymised, with each participant being ascribed a pseu- 
donym. Any information that could reveal the informants’ identity such as names of lo- 
calities, places of study etc. is removed from the published interview fragments. In- 
formation about the countries of residence is, however, retained, since it is impossible 
to determine the participants' identity from it, and yet this is important for analysis. 


124 


Balcania et Slavia e-ISSN 2785-3187 
1,1,2021, 117-148 


Erzhen Khilkhanova 
Language Ideologies and Multilingual Practices of Post-Soviet Migrants 


12 


w 


> 25 years 26-35 years 36-45 years 46> years 
9 
7 
| | 
| | 
| i i E RE 
student finance / consulting translation / editing 


Figure l Agecharacteristics of informants 
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we can see that the most numerous (almost half of the respondents) 
was the group of migrants, whose ages ranged from 26 to 35 years. 

The informants' educational level and social status are reflected in 
their fields of activity [fig. 2]. Note that almost all participants worked 
or studied, except for three who were housewives at that time. Fig- 
ure 2 shows that the majority of respondents work in the field of ed- 
ucation and science (3296) or are college students (2396), mainly en- 
rolled in master programmes. 
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The interview toolkit consisted of (1) questions identifying the re- 
spondents' social characteristics (age, sex, marital status, number of 
years spent in Western European countries, information on education 
both in the home country and the host countries, professional back- 
ground, migration status as well as reports on the ethnic self-identi- 
fication of the participants), (2) questions aimed at identifying the in- 
teraction of language, culture and ethnic identity, (3) questions that 
solicited participants' reports on their language proficiency in Rus- 
sian, ethnic language and L3-n, and the use of these languages in the 
migration context. In addition, attention was paid to the social com- 
munication circle of post-Soviet migrants, i.e. social networks. An im- 
portant goal was to find out how social networks (connections) of mi- 
grants affect their language practices and, vice versa, what this case 
adds to the already known models of interaction between the struc- 
ture of social networks and the use of languages in migrant commu- 
nities. Therefore, a part of the interview questionnaire focused spe- 
cifically on eliciting information on participants' networks and the 
links between the personal social networks and language use. 

Since the original purpose of the study was not related to trans- 
languaging, I drew attention to the almost complete absence of a dy- 
namic and functionally integrated use of different languages in the 
participants' speech later. Therefore, for a deeper study of this topic 
I decided to study samples of spontaneous speech, since it is in spon- 
taneous speech that the creativity of multilingual speakers can man- 
ifest itself to the fullest (see also García 2009). Interviews, despite 
being held in a relaxed atmosphere and being close to a natural con- 
versation, cannot by definition represent completely free speaking. 
Therefore, the field research materials were subsequently supple- 
mented with data obtained from the analysis of virtual social com- 
munities of several migrant groups from post-Soviet Russia. Social 
media such as Facebook are a popular object of research on translin- 
gual practices (Halim, Maros 2014), because they represent 


a new, delocalized, multimodal space positioned on the bounda- 
ry between orality and literacy in which translingual practices 
emerge spontaneously. (Moustaoui, Prego, Zas 2019, 3) 


This method called 'virtual ethnography' was carried out on posts 
and informal discussions in social network groups of Russian-speak- 
ing migrants of non-Russian ethnicity residing in different western 
countries. 

So, generally this study is based on a variety of methods to obtain 
and analyse empirical data, namely interviews and participant ob- 
servations as central ones, in conjunction with a virtual ethnography 
of online interactions of Buryat, Yakut and Kalmyk migrants living 
in western countries and the sociolinguistic analysis of online social 
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media. Six social network groups in Facebook and Instagram became 
the object of my study. In total, I analysed the written speech prac- 
tices of the following online communities: Kalmyks in USA / Kalmy- 
ki v SShA, which included 3,000 members at the time of writing this 
part of the article (June 2021), sakhadiaspora.official in Instagram 
with 2,695 subscribers and Sakha Diaspora community organisation 
in Facebook (171 followers), and three Buryat groups in Facebook 
(Vstrechi v Evrope; Buryat House, USA; and Buryat connection UK 
with total 1,600 members). All social media were viewed in the peri- 
od from 2016 to June 2021, with the exception of the Sakha Diaspora 
Facebook page, which was created on December 11, 2020. 

Using virtual ethnography poses additional ethical considerations, 
although most Internet communication is considered public due to 
its free availability. Like in other studies employing data from social 
media (Kliuchnikova 2016, 71), my approach was also to avoid us- 
ing any material that could not be accessed freely, without the spe- 
cial permission of the author. Nevertheless, I still do not provide the 
full (nick)names of the writers I quote. Only the initials, names of 
the participant's group and the dates of publications are indicated. 

For the comparative analysis of modern data with historically ear- 
lier information about the speech practices of non-Russian migrants 
from the Soviet Union, the paper employs secondary sources, i.e. in- 
terviews with Oirat-Kalmyks living in the United States. The inter- 
views were published in the book by E. Guchinova who studied the 
history and ethnicity of the Kalmyk community in the USA in 1997- 
98 (Guchinova 2004). 

It should be noted that the pool of informants is limited by two 
important and related characteristics: the high level of education of 
the studied migrants and the legality of migration channels. Almost 
all interviewed informants had a higher education (24 people), seven 
of them possessed PhD and Candidate of Sciences degree, and two 
people have finished professional (vocational) schools. The high lev- 
el of education reflects the general picture of post-Soviet migration, 
which is closely related to the problem of ‘brain drain’ from Russia. 
For example, in the United States the 2000 census recorded a very 
high level of education among the Russians who arrived: more than 
half of those over 25 years old (52%) had a bachelor's and master's 
degree. Only one in four Americans had an equivalent level of educa- 
tion (24%) (Denisenko 2013). In the early 2000s, 43% of Russian mi- 
grants in OECD countries aged 15 years and over had higher educa- 
tion. For immigrants from most other countries, including the OECD, 
this indicator was lower (Dumont, Lemaître 2005). 

Thus, the studied group represents a certain segment in the gen- 
eral emigration flow from FSU countries - namely intellectual mi- 
gration. Currently, the concept of 'intellectual migration' mainly 
describes the processes of departure of scientists and qualified per- 
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sonnel to work outside their country. But there is no single mean- 
ingful definition of the phenomenon of intellectual migration in the 
scientific discourse. The broadest interpretation implies that intel- 
lectual migrants are all professionally qualified persons who have 
been abroad for more than one year (Ledeneva 2014, 107). This broad 
interpretation of intellectual migration is adopted in this paper. Ac- 
cordingly, the characteristics and behavioural strategies of Buryat, 
Kazakh and Yakut migrants are determined by belonging to this cat- 
egory of migrants and, most likely, cannot be extrapolated to other 
groups, for example, unskilled migrants or illegal labour migration. 

The next limitation was due to my ignorance of the Kazakh and 
Yakut languages, which also excluded the possibility of Kazakh-Rus- 
sian and Yakut-Russian CS and CM. However, considering that there 
were 22 Buryat migrants among the Russian participants, the pos- 
sibility of communicative practice that “blurs or breaks through ap- 
parent boundaries” (Blackledge, Creese 2016, 2) between Russian 
and Buryat remained open but was rarely noticed by the researcher 
during ethnographic observation and interviews. It should be noted 
that the possibility of translingual transitions between Buryat and 
Russian was initially excluded for seven Buryat informants who did 
not know the Buryat language (there were no such people among 
the Yakut and Kazakh informants). Despite my attempts to conduct 
a conversation in the Buryat language with Buryat informants, most 
of them could not maintain a lengthy conversation in it. As one of the 
participants put it, "I'm not as good [in Buryat] as you are” (Vlada, 
32, Germany). So, the main language of the interview and communi- 
cation beyond it was Russian. 

In addition, all informants had a good (in many cases native-like) 
command in at least one foreign language, primarily English, but also 
German, French etc. They knew that the interviewer was also fluent 
in English and German, so they had a full opportunity to dynamical- 
ly use at least these languages along Russian. However, despite their 
multilingual past and present, the speakers remained mostly within 
the monolingual Russian discourse, although the cases of translan- 
guaging were also present. They are discussed in the following section. 


4  Translanguaging Practices of Post-Soviet 
Non-Russian Migrants 


In this section, I consider translingual transitions and creative, play- 
ful usage (or the lack thereof) of languages by migrants in oral speech 
(recorded during interviews and ethnographic observation) and in- 
formal written communication in social networks. The data obtained 
contain not so many examples when the speakers collaboratively en- 
gage in fluid, plurilingual communication practices. 
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The recorded cases of translanguaging were mostly insertions of 
single lexemes (sometimes longer speech segments) from L1 and L3-n 
into the predominantly Russian discourse. I interpret insertions as 
incidental cases of CS (see also Myers-Scotton 1993, 163). The iden- 
tified CSs can be grouped into the following categories: 


1. non-equivalence vocabulary (when describing cultural-specific re- 
alities) or words for which the speaker could not immediately find a 
translation equivalent or did not want to spend speaking efforts on it:° 


A camoe-To uHTepecHoe 65/0, YTO eru uHorga npueaxaM, n 3axonnunn BOT 
Tak 3a MOpor, ja, Vl Mbl, 3HAWUT, c HAMM 340poBaemca, A B 6ypaTckom Bcerga 
6bina, 6ypAag g3r3nT3Ù, ryTan, manrarali xy ymgeg. (Densema, 56, Germany) 
And the most interesting thing was that children sometimes came, and went 
like this over the threshold, yes, and we, then, say hello to them, | was always in 
Buryat, in the Buryat clothes, shoes, cap and all on. 


ALYONA Syuycb ceñyac Ha Arzthelferin 
INTERVIEWER Ho ato He bachelor's? 
ALYONA Her. Mpocto Ausbildung. CpegHee (Alyona, 24, Germany) 


ALYONA lam currently studying to become medical assistant. 
INTERVIEWER But it's not bachelor's? 
ALYONA No. Just training. Vocational’. 


Mendid Kalmyks! In celebration of the year of the Gaha, there will be a Tsagan 
Dance next month in Philly. ALL PROCEEDS WILL BENEFIT THE PHILLY HURUL! 
(Facebook, Kalmyks in USA / Kanmbiku B CLLIA, T., 6 January 2019) 

Hello Kalmyks! In celebration of the Pig year, there will be a White Dance next 
month in Philly. 

ALL PROCEEDS WILL BENEFIT THE PHILLY TEMPLE! 


2. Clichéd etiquette formulas (for example, greetings): 


OHu 3axogsr, A roBopro: “Cavin 6aííua! 33 OporTbi Jaa, 433W raparTbl", n NOTOM 
Ha pycckoM. Tak eru HM 6ypaTcKoro, HU pycckoro He 3HaloT! 

(Densema, 56, Germany) 

They come in, I say: “Hello! Well, come in, come right in”, and then in Russian. So 
the children do not know neither Buryat nor Russian! 


XanbMry&, MeHgyT! KTO B CLUA, no6agnsürecb! /JaBaitre o6ujarbcs 
(Facebook, Kalmyks in USA / Kanmbiku B CLA, A., 20 March 2019) 
Kalmyks, hello! Who is in the USA, please add! Let's communicate’ 


6 Transcripts’ legend: italics for ethnic language (L1); standard font for Russian lan- 
guage (L2); underlined for English, German (L3-n); bold for mixed translingual forms. Ex- 
amples from social networks are given completely in the authors' design. I only removed 
non-verbal symbols (such as emoticons) as uninformative in the context of this study. 
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3. Quotation: 


... M A BAPyYT rpe-ro Tam B Kay Hana cruxorBopenue “WMH HIOJ3H Xapa 
moùhoH w3Hru...” (Densema, 56, Germany) 

... and I suddenly found a poem “your eyes are like black currants” somewhere 
inthe closet... 


Ouu roBopaT: Sie sind doch hier geboren, T.e. oHN ce68 YyBCTBYIOT ce68 KAK 
poma. (Sara, 25, Netherlands) 
They say: “They are born here”, that is, they feel at home’. 


4. Communication topic: 


Mory 6ypsagap nyyrapxa (Radzana, 48, Germany) 
‘I can speak Buryat’ 


yxara 3k-3ukHp! [axe BaMmepukaHcKoù pealbHOCTM Hao NONbITATECA CAeNaTb 
ManeHbKMÍ War HaBCTpeuy Ka/IMbILIKOMy A3bIKy! 

(Facebook, Kalmyks in USA / Kanmbiku B CLA, D., 14 August 2020) 

Smart moms and dads! Even in the American reality, itis necessary to try to take 
a small step towards the Kalmyk language! 


5. Externally unmotivated intersentential CSs: 


VU Tam a gBa roga B Wkony xoguna, Tunx343 hypryynnmHa 6ypsag 6alira xanm. 
(Densema, 56, Germany) 
‘And there | went to school for two years, at that time the school was Buryat’ 


CniyWauTe go koHua. CeprH, cepuxarH xXanbmrygm. llpocbinaürecb, 
npo6yxganrecb Kanmbiku!” (Facebook, Kalmyks in USA / KanMbiKM B CLA, E., 
11 December 2020) 

‘Listen to the end. Wake up, wake up Kalmyks. Wake up, wake up Kalmyks!’ 


6. Externally unmotivated intrasentential CSs: 


Tyr Moxeuib cecrb B pecropaH PaHAOMHO n y Te65 3a cocen HAM CTONOM 6ygyT 
cugetb pycckue, B na6e 6ynyr pycckue, T.e. YyBCTBOBaTb homesick - Hery 
3pecb. (Kira, 28, Great Britain) 

*Here you can sit in a restaurant randomly and you will have Russians at the 
next table. There will be Russians in the pub, that is, to feel homesick - there is 
no such thing here' 


MuHyc B TOM YTO NOCTOAHHO prejudice. llocrosHHo misunderstanding, Bce 
nena. (Sara, 25, Netherlands) 

*The disadvantage is thatthere is always prejudice. Constantly misunderstand- 
ing, all that stuff’ 


7 Here the translation from Kalmyk into Russian is given by the author of this post. 
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Kakue desired outcomes? (Dari, 32, Switzerland) 
‘What are desired outcomes?' 


He ro uro6bs! oHa [aHrnuùuckaa KkyNbTypa] TaK APKO BbipaxeHa, OHA [10BO/IbHO 
plain no cpaBHeHuto c apyrumu KynbTypamu. (Dari, 32, Switzerland) 

‘It’s not that it [English culture] is so pronounced, it's quite plain compared to 
other cultures' 


EcTb npegnoxeHne, cKuHyTbca no £15 c KaXXJ]oro B3pOC/IOTO M HAKpbiTb CAMUM 
CTON, à camoe rnaBHoe Hanenutb as much as we want Hawn bYY3bI. (Facebook, 
Buryat connection UK, D., 23 February 2017) 

‘There is an offer to chip in £15 from each adult and cover the table ourselves, 
and most importantly, to make as much as we want our BUUZAS’. 


7. Language creativity, language game: 


A TbI XaMaryMcT? :) (Facebook, Buryat connection UK, B., 12 October 2020) 
Are you a khamaguist (*don't-care-at-all loafer’)? 


8. Discourse markers: 


M Bor ceituac, CMoTpa, na, O6paTHo, also Te nBa roga koropbie s B [mentions the 
name of the village] npoBena, 6binu CaMbIMM T9XXe/IbIMM B MOEÙ xu3HM. (‘Radza- 
na, 48, Germany") 

‘And now, looking back, yes, well those two years that | spent in [mentions the 
name ofthe village] were the hardest in my life' 


When looking at these CS examples, we see that they occur under the 
influence of external, internal and linguistic factors, which were dis- 
cussed in section 2. Quotations and communication topic (examples 3 
and 4) are well-known and frequent linguistic factors of CS (see, e.g., 
Gumperz 1982). Similarly, a person has to resort to the resources of 
another language while referring to the realities of the correspond- 
ing culture and using clichéd etiquette formulas, as in examples 1 and 
2 (for more detail, see Khilkhanova 2009). As J. Fishman explained it, 


Thatlanguage which has traditionally been linked with a given eth- 
noculture is, at any time during which that linkage is still intact, 
best able to name the artefacts and to formulate or express the in- 
terests, values and world-views of that culture. (Fishman 1991, 20) 


However, as stated in section 2, primarily unmotivated, unconscious 
(examples 5, 6, 8) or, on the contrary, conscious, purposeful CSs (ex- 
ample 7) can be considered truly translingual strategies. For instance, 
in example 7 the popular Buryat expression "khama ugy" ('all the 
same, no care, no difference") is played with. The addition of the pro- 
ductive Russian suffix -ist creates an occasionalism with the meaning 
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‘a person who doesn't care’ (in Russian there is a similar slang equiv- 
alent, pofigist, formed according to the same model from the non-lit- 
erary word pofig). The morphosyntactic basis of the phrase "Are you 
a khamaguist?" is provided by the Russian language, serving here 
as a matrix language (Myers-Scotton 1993). In the examples of intra- 
sentential switching, we see that the speakers use different strate- 
gies, sometimes mixing, sometimes switching codes. Among the inser- 
tions, forms without matrix language affixes are quite frequent - the 
so-called bare forms (Myers-Scotton 2006): uysBcrBosBar» (homesick); 
NOCTOAHHO (prejudice); rocrosmno (misunderstanding); oHa JOBONBHO 
(plain). The frequent use of bare forms is noteworthy, because they 
do not violate the grammar of CS. Other cases, such as the word 'ran- 
domly’, where the English root morpheme is supplied with the affix of 
the matrix language - the Russian adverbial suffix -o - are very rare. 

The example with a German discursive marker 'also' deserves a 
separate comment. Discursive markers (hereinafter - DM) in general 
and in bilingual speech in particular are a separate topic that I can 
only touch on briefly here. As a rule, DMs perform modal, deictic and 
metalinguistic functions. In this example, we are dealing with the 
speaker's automatic choice of the DM ‘also’, which happens when lan- 
guage units are more active in the speaker's cognitive base due to the 
more frequent use. In this case, objective, propositional information 
is expressed in Russian, while subjective, evaluative, additional infor- 
mation is expressed using DM from German. Perhaps this is the first 
step towards understanding, evaluating, and commenting on reali- 
ty in another language, i.e. one of the first links in the mechanism of 
language shift (from Russian to German) by this particular speaker. 

It is noteworthy that there are more creative translingual prac- 
tices and language games in the spontaneous online communication 
of migrants from the FSU. Nevertheless, the analysis of social net- 
works shows that the language of communication there is mainly Rus- 
sian, especially in Kalmyk and Buryat social networks. The situation 
with the Yakut online communication is somewhat different. There 
are more instances of spontaneous Yakut speech in posts, responses 
and comments to them, for example: 


Ckopo [enb aKyTcKux onanyuiek! Kyugy Caxa CupuH onoxTooxTopo, 60/I50ffyH! 
Myyc ycrap 7 kyHyr3p Anrbicraax AJIAAJIbbI kyH3 62nu2T3H3p 6yonna. 

By kyHH3 MaaHbi aJlaaJbblÓbITbIH acraaH aliMaxTapObITbIH, JOROTTOPOYTYH bIHbI- 
paH aMcaraóbir, kyHaynyy6yr. 

ATbIH rpynnanapra rapsar, 643p. 

l'oroBbre a/iaagbbi #AKyTCKMeonagbu, ny6nukyUte BUHctarpamu He 3a6ypbre 
nocraBHrTb xa3uirer #anaagbblkyH3! 


The Day of Yakut pancakes is coming soon! Dear Sakha residents, attention! 
April 7 is the Day ofthe Blessed Pancakes. 
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On this day, we fry pancakes and invite and give to try pancakes, we honour our 
relatives and friends with pancakes. 

Report on other groups. 

Prepare alaadyi #yakutskieoladyi, post on Instagram and do not forget to put 
the hashtag #alaadyikune! 


The activity of the Yakut online and offline community, related to the 
relatively good state of the Yakut language, is a well-known fact for 
Turkologists and sociolinguists. According to comparative statistics 
on the presence of minority languages of the Russian Federation in 
the global network (Map of Languages 2016), the Yakut language is 
best represented online along with Bashkir, Udmurt and Tatar. These 
are the languages with the greatest linguistic vitality among all mi- 
nority languages of Russia. This indicates that the Internet does not 
create a new reality, it only reflects the current situation with mul- 
tilingualism in the country. Languages that are safe without the In- 
ternet, with a high level of language activism and national conscious- 
ness of native speakers, are widely used in the global network as well. 

One could assume that few cases of translanguaging in interviews 
between Russian and other languages are due to the fact that the in- 
terview is not a suitable place where personal linguistic creativity 
can be deployed. However, examples of other studies where people 
easily switch between different languages during interviews refute 
this assumption. For comparison, here is a fragment from the inter- 
view with a Kalmyk emigrant of the second emigration wave? taken 
from the book by E. Guchinova who studied the Kalmyk diaspora in 
the United States (Guchinova 2004, 168): 


lnaBHas pa3Huua - TO, YTO Te Ka/IMBIKM CYNTAHT, YTO under KOMMyHM3M 6aacH 
H some kind tunm aura religion Tum ioMra... XOTA TeHg 6binu divided n 6binu 
TAKNE KoTOpble geUcTBUTENbHO against penurus, Ho 6onblunHeTBo 6binu n 
coxpaHanu rep. Tep KomMyHucTuYeckuù pexuw arra xu 6arag ... abolish 
KeukcH - TUNT caHHaB. BOT tunm suggestion. M 6n ctapanca follow 3BpaHb 
parents-uHb. A He ckaxy uro 6bin best cryneur, Ho average... shag. M 3Knag, 
Turag rep maHa old generation Ogpxapanb ogag cyphynbg oHn 6binu mang real 
mother. TegHa noBegeHue mang noHpaBunocb. OHu nokazann Ham knowledge, 
intelligence n KHUXKC Ayyar, n MHE 3TO NoHpaBunocb. TegHa behaviar magHg in- 
fluance kehag mbi noexann Agpxuiop. KoHeuHo, MaHa ypoBeHb knowledge-a nk 
6uui 6una, HO BCE XE Mbl MOTAN uuTaTb, apUdMeTHK... Hac BCe Ke NpuHann B 
30-M rogy B Kanmbiukuù negaroruNecKkuù TeXHAKyM. 


The main difference isthatthose Kalmyks believe that being under communism 
and some kind of such anti-religion such things... although there were divided 
and there were those who were really against religion, but most were and re- 


8 The second wave of Kalmyk emigration occurred during the period of the construc- 
tion of socialism, Stalin's repressions and the Second World War. 
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tained that. That communist regime having existed for seventy years... was abol- 
ished - so! think. Here is such suggestion. And / tried to follow my parents-in’. 
| will not say that | was the best student, but like the average... And to begin, 
then that our old generation went to study in Astrakhan, they were real moth- 
er for us. Their behaviour was liked by us. They showed us knowledge, intelli- 
gence and reading books aloud, and | loved it. Their behaviour made influence 
on us, we went to Astrakhan. Of course, our level of knowledge-a was not high, 
but still we could read, arithmetic... We were still accepted in the year of 1930 
in the Kalmyk Pedagogical College. 


This fragment, which is typical in the speech practices of non-Rus- 
sian migrants of previous waves of the Russian emigration (at least 
the first and second waves), is an example of a translingual discourse 
that the author has rarely encountered not only in the migrant en- 
vironment, but also in Russia. In this fragment, all three languag- 
es are almost equally involved: L1 (Kalmyk, in italics), L2 (Russian, 
standard font) and L3 (in this case, English, underlined). It presents 
a striking contrast to the speech of post-Soviet migrants (especial- 
ly well-educated ones), which is an example of 'pure', grammatically 
correct and unmixed Russian speech. Despite the fact that constant 
bilingualism is often considered to create a specific context that is 
particularly conducive to developing linguistic creativity (Kellman 
2000), speakers remain in a predominantly monolingual Russian- 
speaking mode both online and offline, even in 'ethnic' social net- 
works and when communicating with co-ethnics. 

In general, my data confirmed that for first-generation migrants 
from the FSU the Russian language plays the role of a collective uni- 
fier of the immigrant community, and its pragmatic value is similar 
to that in the USSR. At the same time, the data obtained indicate the 
heterogeneity of speech practices and language competencies of the 
studied migrants' group. For the majority, however, fluid, transling- 
ual practices employing the rich repertoire of their multilingual re- 
sources are not typical, which is especially noticeable in compari- 
son with the speech of representatives of the previous ‘waves’ of the 
multilingual and multiethnic part of the Russian emigration. I be- 
lieve that this is largely due to linguistic ideologies, which will be 
discussed in the next section. 


5 Language Ideologies at Work 


Language ideologies, like any mental formations, are difficult to 
study due to their implicit representation in speech and often hid- 
den character for the speakers themselves. Sometimes they can be 
identified based on specific statements, sometimes only by analys- 
ing speech behaviour regarding the presence or, on the contrary, 
absence of relevant linguistic phenomena. Addressing language ide- 
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ologies presents a new turn in my research, that is I empirically ar- 
rived at this concept when trying to explain the rarity of transling- 
ual speech practices by the studied migrant group. I will begin my 
analysis with one piece of interview that is indicative in this regard:? 


INTERVIEWER What about the language? 

NORA Buryat? No, [mentions the name of town] is a Russian village anyway. 
More Russians live and... my parents did not speak Buryat with me. They didn't 
want people to make fun of me with my accent. They were ridiculed in Soviet 
times. Well, | myself no longer... did not try, or something, to learn the language. 
In principle, if | really wanted to, | could have learned it, but... but when every- 
one around me speaks Russian [...]. But when! came here, | missed it a little, and 
I’m a little ashamed, because | don’t know it. Because everyone here asks me 
first of all: do you know your language? When people ask me where | am from, | 
say:lam from Russia, from Buryatia. Where's that? I say: well, there ... between 
Lake Baikal and Mongolia. Or they say to me for example: you don't look like a 
Russian. | say: well, we live there on the border with Mongolia. Oh, do you speak 
Mongolian? No, don't. Well, it's a little embarrassing, of course. When | moved 
here, for some reason | wanted to learn Buryat even more than when | was in 
Buryatia. (Nora, 30, Germany) 


This fragment reflects both objective and subjective factors that de- 
termined the language situation and speech behaviour of people in 
the Soviet period through the prism of the migrant's personal per- 
ception. One of the objective factors is demography - by 1989, Rus- 
sians made up 49.396 of the total population of the Soviet Union (All- 
Union Population Census 1989) and 79.896 in the Russian Federation 
eleven years after the collapse of the USSR (All-Russian Population 
Census 2002). Subjective factors refer to people's perception and han- 
dling the language inequality in the Soviet Union reflected in Nora's 
brief narrative. Traumatic experience and internalized ideological 
views about the status, hierarchy and pragmatic values of Russian 
and minority languages resulted in a certain linguistic behaviour, 
when the older generation did not pass on the ethnic languages to 
children and encouraged their study of the Russian language. In this 
way, they wanted to save their children from the negative experience 
they themselves went through and to ensure their social success. In 
that way, the ‘monolingual ideology', i.e. Russian monolingualism, 
was formed in the mass consciousness of the Soviet people, Russians 
and non-Russians alike. As A. Burykin put it: "Bilingualism in gen- 
eral is not characteristic of the Russian language mentality. At the 
level of everyday consciousness, the command and use of other eth- 
nic groups' languages is not encouraged and not welcomed by carri- 


9 Sincethere are no cases of translanguaging here, the fragment is given in English 
translation at once. 
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ers of this mentality” (Burykin 2004a; see also Baranova, Fedorova 
2018). This statement is confirmed statistically as well: only 0.6% of 
ethnic Russians spoke regional languages or the languages of the in- 
digenous peoples of Russia, and the tendency of Russians to remain 
monolingual continues (Mikhalchenko, Trushkova 2003; Kharitonov, 
Stepina 2020). Speaking in a non-Russian language in the presence 
of Russian speakers was also considered ‘indecent’ in almost all re- 
gions of the former USSR (Burykin 2004b, 27). It is also a manifes- 
tation of ‘monolingual ideology’ in communicative behaviour. Oth- 
er studies have also repeatedly stated that during the Soviet years, 
people were reprimanded or given pejorative assessments in cities 
because they spoke non-Russian languages in public places (Fergu- 
son 2016b, 99; Khilkhanova 2020). 

However, the above fragment reflects not only the ideology of 
(Russian) monolingualism. Nora also speaks about the shame for 
not knowing the native language and about her wish to learn Bury- 
at “even more than when I was in Buryatia”. Although such feelings 
and the desire to ‘regain’ an ethnic language rarely result in con- 
crete actions to learn it and the majority of migrants remain with the 
same knowledge (or ignorance) of ethnic language with which they 
went abroad, their language attitudes, however, change. It can be al- 
so seen in another interview fragment: 


fl CTONbKO pà3HbIX A3bIKOB 3Ha10, a cBOero pogjHoro He 3Hato. losroMy missing 
something, ollyljeHue roTepaz. (Sara, 25, Netherlands) 

| know so many different languages, but | don’t know my native one. Therefore, 
missing something, a sense of loss. 


This quote illustrates that abroad, in a foreign cultural environment, 
the ethnic awareness of migrants raises, including a sense of the val- 
ue of native language as an important component of identity. In ad- 
dition, the Soviet ideas of language hierarchy and association of the 
Russian language with progress and civilisation, and other languag- 
es with backwardness and an uncivilised way of life are erased in the 
minds of migrants. Such changes in the perception and evaluation of 
languages are also determined by the migrants’ new European lo- 
calities. I mean that Western Europe is the birthplace of a new ideo- 
logical paradigm built on the recognition of multilingual and multi- 
cultural values (Smokotin 2010, 4). This ideological turn was caused 
by the liberalisation of economic, political and social life in Western 
countries after the Second World War. In the USSR at that time, on 
the contrary, the early Soviet policy of supporting multilingualism 
was replaced by the policy of promoting the Russian language as the 
language of interethnic communication, so that it became the lin- 
gua franca of the entire country. Of course, even in Western Europe, 
the liberal approach is not universally recognised: there are enough 
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examples of preserving ‘old’ ideologies such as ‘one nation-one lan- 
guage’, ‘language-of-the-past’ (in relation to minority languages), and 
many others (Sallabank, Marquis 2018; Toivanen 2015). Neverthe- 
less, in general, the liberation from language bias and the increased 
interest and pride in their language and culture are noted by many 
interview participants. This indicates that language ideologies as dy- 
namic entities strongly depend on the social context. 

The next interview fragment illustrates the attitude of a study par- 
ticipant to her CS between Russian and German: 


ABuenoM ecnu 6parb, si yyBCTByIo ce68 cso6ogHee, Kora Ha DyCCKOM, NOTOMY 
UTO A nyuuie Mory cpopMynupoBarb CBON (ppa3br n BbIPa3UTbCA TouHee. To, YTO 
nepekmtoyalocb - 3To neua/ibHo. (Edita, 26, Germany) 

And in general, if you take it, | feel freer in Russian, because | can better formu- 
late my phrases and express myself more accurately. The factthat switch is sad. 


We see that the informant assesses her CS as a disadvantage. In my 
view, this statement manifests the normative idea about language 
purity shared by many Russians. Purism and prescriptive language 
ideologies are very characteristic of the language culture of Russian 
society. As K. Pischlóger put it, 


Although language purism or 'prescriptivism' is not restricted to 
Russia, Russia belongs to a 'standard language culture' [...] in 
which there [...] is the view that one variety of language has an in- 
herently higher value than others, and that this ought to be im- 
posed on the whole of the speech community [...]. This attitude is 
true particularly for the Russian language, but these ideas and 
standards are transferred by philologists and speakers of Udmurt 
and other minority languages in Russia to their own language(s). 
(Pischlóger 2016, 112) 


These prescriptive and puristic attitudes have led to an almost ex- 
clusive scholarly interest in the standardised modern literary lan- 
guage (114) and in the 'high' spheres such as fiction, which is also 
explained by the authority of classical Russian literature. Therefore, 
‘low’ spheres - such as the language of mass communication, spo- 
ken language and obscene vocabulary - have become objects of lin- 
guistic research in Russia only since the last decades of the twenti- 
eth century. 

In many respects, such normative language attitudes originate in 
the Russian school where the normative approach to language dom- 
inates. The mode of duty, prescriptions, as well as 'orthographocen- 
trism' are the most important features of Russian linguistic think- 
ing (Golev 2002). 
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The desire to ‘abstain from mistakes’ taken to its farthest limit 
becomes a brake on the speech development, which should lead 
to free, creative expression of thought (it is enough to recall the 
advice of experienced teachers and tutors to their pupils to avoid 
phrases in the unmistakable spelling of which they are not sure). 
(Golev 2002, 190) 


This is true both in relation to Russian and foreign languages. In So- 
viet times, the grammar-based method was dominant in foreign lan- 
guage teaching at Russian schools and universities. This resulted in 
grammatically correct speech, on the one hand, and in the fear of 
speaking out of fear of making mistakes, on the other. 

Therefore, it is perhaps not surprising that translingual freedom 
and creative treatment of languages, by definition violating their 
grammatical correctness and ‘purity’, are not widespread in the mass 
language consciousness of Russian residents. Of course, this is not an 
absolute statement, and the lingvocreative treatment of the Russian 
language is a frequent phenomenon both among professionals and 
lay people. Constant CS between Russian and minority languages is 
also widespread in Russia. However, for a large number of people, 
especially educated ones, correctness is absolutised, dogmatised, 
and sacralised (Golev 2002, 184), and language purism is one of the 
most common language ideologies in Russia. This explains the nu- 
merous debates about the ‘corruption’ of the Russian language under 
the influence of English. One of the Russian politicians V. Zhirinovs- 
ky expressed this mass concern proposing to do everything to “stop 
the Russians from using borrowed words and replace them with na- 
tive Russian words” (Boguslavskaya, Kitanina 2016, 12). The prev- 
alence of puristic ideologies in relation not only to Russian, but al- 
so towards minority languages was noted, apart from K. Pischlóger, 
by researchers of Sakha-Russian bilingualism (Ferguson 2016a), of 
Ukrainian and Russian languages (Bilaniuk 2005). 

In the migration context, similar results were obtained by Kliuch- 
nikova (2016) who studied linguistic practices, language attitudes 
and discourses on language(s) of Russian-speaking migrants in the 
North-East of England. She states that despite her initial assump- 
tions, in her fieldwork she found very little evidence of any ‘ludic’ 
initiatives among the group under study, at least in the way that 
would explicitly operate both sets of linguistic resources - English 
and Russian. Kluchnikova explains rare linguistic games among Rus- 
sian-speaking migrants by several reasons including low numbers, 
irregular contacts and general unpreparedness for creative experi- 
ments. However, as one ofthe cornerstones for maintaining linguistic 
practices, the author names the set of attitudes towards language(s) 
and normative linguistic behaviour by bilingual speakers themselves 
(Kliuchnikova 2016, 277). The most influential language attitude, in 
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Kluchnikova's view, is Russian speakers’ admiration of highly stand- 
ardised, ‘classical’, literary norm of Russian that dominates over any 
urge to explore bilingual limits. The sense of belonging to the ‘Rus- 
sian-speaking continuum’ through shared norms and practices leaves 
little space for bilingual manoeuvres on the margins. Rare counterex- 
amples illustrate the point providing a scarce contrast to wider cre- 
ative passivity and inertness (Kliuchnikova 2016, 271). 


6 Results and Discussion 


This research examined the speech practices of a well-educated mul- 
tiethnic and multilingual group of migrants from the former USSR 
in Western Europe from the translanguaging perspective. The field 
study participants were a heterogeneous group not only in terms of 
ethnicity, but also in terms of linguistic competence in L1, L2 and 
L3-n. Russian was the native language for one third of Buryat mi- 
grants who did not speak their ethnic language. At the same time, all 
migrants’ Russian language proficiency was native or native-like. Al- 
so, all of them had high linguistic competence in foreign languages, 
primarily in English. Thus, the possibility of translanguaging (includ- 
ing CS as one of the translingual strategies or as an instance of trans- 
languaging,) between Russian and foreign languages was open to all 
research participants, and between L1 and L2, for two-thirds of them. 
Nevertheless, the analysis of interviews, ethnographic observation 
data and the language of migrant social networks revealed few trans- 
lingual strategies and practices; in fact, instead of translanguaging, 
both conscious and unconscious language choices in favour of Rus- 
sian were made. There are many reasons for this: external, internal 
and linguistic. Among the external ones are low numbers of Buryat, 
Kazakh and Yakut migrants, irregular contacts with co-ethnics and 
dispersed living of these small groups’ members. However, the main 
reason for replacing translanguaging with linguistic assimilation is, in 
my opinion, that first-generation migrants from the FSU countries are 
products of the Soviet language policy. Therefore, in addition to ‘ex- 
porting’ their linguistic repertoire, they also export the relationships 
between L1 and L2 that have been established in their home country. 
Both in the country of origin and abroad, L2 (Russian) satisfies the 
need for mutual understanding, being the ‘language of interethnic 
communication’ of the multiethnic group of migrants from the FSU. 
In post-Soviet Russia, language relations between the state (Rus- 
sian) language and more than 150 other languages have not changed 
much. The short period of the ‘parade of sovereignties’ and “mobi- 
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lised linguicism’'° in the nineties brought some changes: in almost all 
national republics of the Russian Federation, laws on languages and 
the concepts for the development of titular languages were adopted, 
and minority languages were introduced in school education. How- 
ever, already shortly afterwards, the state returned to the Russian- 
dominated ethnolinguistic policy; as a result, the language shift to- 
wards Russian continues among most minority groups in Russia (see 
also Mikhal'chenko 2019). 

In addition to the ideas of language hierarchy and the pragmatic 
value of languages ingrained in the consciousness of people, language 
purism and the (Russian) ‘monolingual ideology’ are also widespread 
in the mass consciousness of Soviet people and Russian citizens. Of 
course, the ‘monolingual ideology' was primarily part of the state lan- 
guage policy, but, like with all ideologies, the mechanism of its forma- 
tion and functioning is reciprocal. By reciprocity I mean that, on the 
one hand, this and other ideologies regulate people's social behaviour 
and dictate how they should think and behave. On the other hand, the 
language ideologies were implemented not only ‘from above’, but also 
'from below', driven by ideological views about languages and their val- 
ues internalised by the Soviet people. Since the language policy of Rus- 
sia largely reproduces the Soviet one, as mentioned above, these lan- 
guage ideologies are also common among citizens of modern Russia. 

Reciprocity is also manifested in the fact that language ideolo- 
gies influence social and political reality and, in turn, are formed 
under the influence of situational, institutional and social contexts. 
Migration to Western European countries - where, according to in- 
formants, there is the rule of law and democratic values, non-dis- 
crimination, equality of career opportunities and conditions for self- 
realisation - changes language ideologies towards greater liberality 
and tolerance. This proves once again that language ideologies are 
dynamic entities and strongly depend on the social climate and soci- 
etal expectations. Similar changes in language ideologies are taking 
place in Russia, but with a delay of several decades. 

All this explains the migrants' reluctance or lack of need to cross 
the language boundaries and to create hybrid forms. The above-de- 


10 The 'parade of sovereignties' was a series of declarations of sovereignty of vari- 
ous degrees by the constituent republics of the Soviet Union from the late 1980s to the 
early 1990s. It followed a loosened power grip of the Soviet Communist Party as a re- 
sult of the (democratisation) policy of the perestroika period. These declarations chal- 
lenged the priority of the all-Union legislation over the republican ones (which started 
the so-called ‘war of laws’) and took actions to strengthen the republics’ economic in- 
dependencies. The 'parade of sovereignties' ended with the collapse of the USSR (Sint- 
sov, Bityutskiy 2019). ‘Mobilised linguicism' refers to those movements in the repub- 
lics the USSR consisted of, which ended with the adoption of laws proclaiming the lan- 
guages of the peoples (nationalities), after which the republics were named, as their 
state languages (Guboglo 1998). 
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scribed language ideologies block the potential translanguaging that 
is based on the recognition of all languages as equal. Or vice ver- 
sa, translanguaging can also be driven by directly opposite motives: 
awareness of linguistic inequality and protest against it encourage a 
person to provocation, aggression and creation of artificial languag- 
es', as in the mentioned book by Zaimogly, Kanak Sprak. In the ab- 
sence of such awareness and protest, a person, especially one who 
has passed all levels of the Russian-language education system, au- 
tomatically reproduces the monolingual patterns of 'pure' speaking 
in a more prestigious and more frequently used language. 

The control of consciousness that prohibits making mistakes and 
the normative linguistic behaviour are also due to the fact that all 
highly educated migrants grew up reading classical Russian litera- 
ture** and absorbed the values of Russian ‘standard language cul- 
ture'. Thus, the general cultural level of the migrants also does not 
contribute to non-literary, profane, provocative treatment of languag- 
es. The little 'urge to explore bilingual limits' and to dynamically 
move between languages applies to both minority and foreign lan- 
guages in contact with Russian. 

To conclude, I hope that empirical findings and theoretical insights 
generated from the research presented in this paper can improve our 
understanding of translanguaging and language ideologies, particu- 
larly in relation to the FSU region. Hopefully, it also adds to an ex- 
isting body of research on the Russian-speaking 'ethnic' migration 
and contributes to the disciplinary fields of sociolinguistics, linguis- 
tic and cultural anthropology. 


11 Other ethnic literatures are not taught in Russian schools and universities, except 
for a few so-called 'national' faculties and schools. 
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